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This  report  develops  the  theoretical  foundation  for 
analytical  description  and  quantification  of  habitat  struc- 
ture.'The  methodology  is  based  on  the  concept  of  environ¬ 
mental  gradient  analysis>The  analytical  description  of  en¬ 
vironmental  gradients  is  shown  to  be  an  eigenanalysis 
problem,  mathematically  equivalent  to  the  largest  eigen¬ 
vector  (or  first  principal  component)  of  a  principal  com¬ 
ponents  analysis.  The  analytical  representation  of  an  en¬ 
vironmental  gradient,  itself  a  single  variable,  is  empirically 
demonstrated  to  have  similar  ecological  information  as  the 
combination  of  all  the  original  58  habitat  variables  describ¬ 
ing  five  Mojave  Desert  study  sites. 

Two  vastly  different  data  bases  were  analyzed  to  explore 
the  effects  of  sample  sizes  and  variable  selection  on  the 
ordination  of  study  sites  in  both  principal  components  and 
canonical  variate  space.  Merits  and  shortcomings  of  princi¬ 
pal  components  analysis,  canonical  analysis  of  discrimin- 
ance,  and  cluster  analysis  for  the  ordination  and  classifica¬ 
tion  of  samples  are  reviewed  in  detail./An  optimal  "recipe" 
is  developed  for  ordinating  habitat  samples  along  the  major 
environmental  (disturbance)  gradient. 

Canonical  variate  ordinations  were  shown  to  be  sensitive 
to  sample  sizes,  the  choice  of  variables,  and  linear  depen¬ 
dency  among  variable^ Canonical  analysis  of  discriminance 
is  a  very  effective  mechanism  for  classifying  samples  into  a 
priori  established  groups,  or  for  identifying  variables  that 
contribute  significantly  to  group  discrimination.  ~ 

Cluster  analysis  proved  to  be  an  excellent  technique  for 
exploring  the  effectiveness  of  variable  subsets  in  classifying 
samples  into  a  priori  established  groups.  The  R-mode  of 
cluster  analysis  is  recommended  for  identifying  independ¬ 
ent  variable  subsets  and  eliminating  redundant  variables. 
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ENVIRONMENTAL  GRADIENT  ANALYSIS,  ORDINATION,  AND 
CLASSIFICATION  IN  ENVIRONMENTAL  IMPACT  ASSESSMENTS 


I  INTRODUCTION 


Background 

Traditionally,  the  physical  structure  (physiognomy)  of  vegetation  is  the 
most  important  habitat  describer,  especially  for  understanding  the  structure 
and  species  richness  of  bird  communities.  Previous  studies  (70,  80,  52,  60, 
91,  2,  106,  119,  111,  112,  86,  58,  73,  16)*  have  emphasized  the  importance  of 
foliage  height  diversity,  vertical  layers  of  vegetation,  horizontal  patchiness 
(habitat  mosaics,  including  the  presence  of  habitat  edges),  and  optimal  "pro¬ 
tected"  nest  sites  (e.g.,  spinescent  shrubs;  tree,  snag,  and  cactus  boles). 

Plant  species  compositions  (floristics)  have  also  explained  the  makeup  of 
avian  communities  (46,  6,  118,  69,  32,  50,  54,  114).  Since  a  plant  species 
may  have  a  specific  physical  structure,  it  is  generally  difficult  to  uncouple 
species  composition  and  physical  structural  components.  However,  specific 
plant  species  often  show  sufficiently  distinct  resources  (e.g.,  fruit,  nuts, 
seeds,  nectar,  pollen,  specific  nest  sites)  that  the  value  of  these  resources 
can  be  quantified.  Avian  researchers  have  generally  overlooked  plant  species, 
in  terms  of  the  arthropod  communities  they  harbor,  as  specific  food  resources 
for  birds.  Arthropod  species  richness,  taxonomic  compositions,  numerical 
abundances,  and  body  size  distributions  may  depend  more  on  the  taxonomic 
composition  of  the  vegetation  than  on  vegetation  density,  volume,  or 
structural  complexity.  Furthermore,  the  evolution  of  plant  toxins  and  the 
corresponding  arthropod  herbivores  is  closely  interrelated  (10,  24,  97,  27, 
28,  29,  68,  81,  38,  85,  8).  However,  Jermy  (57)  presents  evidence  that 
strongly  challenges  traditional  theories  of  the  coevolution  of  plant  defense 
toxins  and  insect  herbivory.  Plants'  chemical  defenses  influence  arthropod 
communities  and  therefore  avian  communities  in  two  important  ways.  First, 
plant  species  that  have  toxins  to  deter  herbivores  may  harbor  fewer  numbers 
and/or  smaller  body  sizes  of  arthropods.  Theoretically,  these  plants  should 
maintain  fewer  taxa  of  herbivores,  since  specialization  is  required  to 
detoxify  chemical  defenses.  Second,  at  least  some  of  the  arthropod  fauna  may 
be  unpalatable  to  avian  predators. 

Both  leaf  morphology  and  the  arrangement  of  leaves  on  branches  correlate 
with  avian  foraging  strategies,  and  therefore  represent  species-specific 
resources  (51).  Bark  surface  morphology  (55,  56,  107)  and  twig  diameter  (77) 
also  determine  species-specific  foraging  on  bole  and  limb  surfaces. 

These  examples  show  that  many  environmental  features — all  potentially 
important  and  some  very  crit ical--underl ie  habitat  characterization.  Describ¬ 
ing  habitats  in  an  analytical  framework,  as  well  as  specifically  identifying 
the  relative  importance  of  habitat  components  to  the  community  structure  of 


'-Numbers  in  parentheses  refer  to  references  provided  on  pp  107-114. 


vertebrate  faunas,  challenges  experimental  designs,  sampling  theory,  data 
analysis,  and  ecological  interpretation.  An  even  stronger  challenge  faces  the 
synthesis  of  ecological  theory  and  land,  resource,  and  wildlife  management. 

Research  that  focuses  on  understanding  the  complexity  of  interrelation¬ 
ships  between  species  and  their  habitat  requirements  is  necessary  for 
maintaining,  restoring,  and  managing  Army  lands  to  optimize  training  realism, 
conserve  natural  resources,  maintain  ecosystem  integrity,  and  comply  with 
Federal  and  State  mandated  environmental  regulations.  Previous  research  by 
the  U.S.  Army  Construction  Engineering  Research  Laboratory  (USA-CERL)  at  the 
Army  National  Training  Center,  Fort  Irwin,  CA,  collected  data  on  the  effects 
of  large-scale  Army  training  maneuvers  and  war  games  on  the  surrounding  desert 
ecosystem  (64).  This  study  also  developed  analytical  methodologies  for 
quantifying  environmental  impact  assessments,  and  analytically  described  and 
summarized  species-habitat  associations.  The  information  obtained  will  be 
used  as  guidelines  for  managing  desert  nongame  species  and  ecological 
communities  at  Army  installations. 

An  important  basis  for  the  previous  USA-CERL  study  (64)  was  the  concept 
of  environmental  gradients,  which  ranks  impacts  to  provide  a  base  for  deter¬ 
mining  land  and  species  disturbances.  The  information  in  the  current  report 
outlines  the  theoretical  and  analytical  framework  used  for  the  previous  study. 


Objective 

The  objectives  of  this  report  were  to:  (1)  present  the  concept  of 
environmental  gradients  and  evaluate  its  potential  for  environmental  impact 
assessment,  (2)  develop  an  analytical  methodology  for  describing  environmental 
gradients,  (3)  explore  the  sensitivity  of  sample  sizes  and  variable  selection 
on  ecological  ordinations,  and  (4)  explore  analytical  methodologies  for 
classifying  environmental /ecological  samples. 


Scope 

A  consequence  of  this  research  is  the  analytical  and  unbiased  description 
of  habitat  structure  in  terms  of  physical  and  vegetation  characteristics.  In 
some  instances,  chemical  parameters  may  be  important:  pH,  salinity,  alka¬ 
linity,  oxygen  tension,  micro-  and  macronutrients,  ailelopathics,  pesticides, 
toxins,  and  heavy  metals  compositions  and  concentrations.  However,  even  in 
these  circumstances,  a  knowledge  of  vegetation  and  soils  will  usually  be 
closely  correlated  with  the  chemical  environment  in  terrestrial  as  well  as 
riparian  ecosystems.  The  chemical  environment  plays  a  more  immediate  and 
direct  role  in  the  spatial  and  temporal  organization  of  aquatic  ecosystems. 
Chronic  dose  concentrations  of  natural  or  anthropic  elements  or  compounds  will 
not  be  considered  as  part  of  the  habitat  Gestalt. 


Approach 

Two  databases  (Mojave  Desert  and  Illinois)  were  selected  to  explore  the 
effects  of  sample  sizes  and  variable  selection  on  study  site  ordinations  for 
gradient  characterization.  Variables  for  quantifying  habitat  characteristics 
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were  chosen  for  the  two  database  Locations.  Data  were  gathered  for  each  of 
the  variables  and  subjected  to  multivariate  statistical  analysis.  Analytical 
results  were  then  examined  and  the  advantages  and  disadvantages  of  each 
stastistical  analysis  method  used  were  noted. 

Node  of  Technology  Transfer 

This  research  contributes  to  the  fundamental  understanding  of  Army 
training  impacts  and  their  quantitative  assessment  and  description.  It  is 
recommended  that  the  results  be  used  to  develop  more  effective  impact 
prediction  methods,  land  maintenance  technologies,  and  analytical  approaches 
to  resource  and  wildlife  management. 


2  THEORETICAL  BASIS  FOR  CHARACTERIZING 
ENVIRONMENTAL  GRADIENTS 


Environmental  Gradients 

In  any  Rind  of  environmental  assessment  or  evaluation,  a  series  of  "study 
sites"  is  selected  that  represents  an  ordinal  ranking  of  impacts,  distur¬ 
bances,  or  recovery,  including  controls.  This  means  that,  based  on  visual 
perception  and  experience,  an  ecologist,  land  manager,  or  concerned  citizen 
identifies  land  values  on  an  environmental  ranking  (an  environmental  gradi¬ 
ent).  Typically,  the  two  most  familiar  examples  of  these  gradients  are 
environmental  disturbance  and  successional  gradients.  After  sites  are  identi¬ 
fied,  experience  and  the  literature  dictate  the  choice  of  sampling  strategies 
and  which  habitat  (environmental)  variables  to  measure.  Given  that  informa¬ 
tion,  samples  of  this  specific  nature  must  exemplify  their  greatest  intersite 
variance  (variability)  on  habitat-variable  sets  that  closely  associate  with 
habitat  description.  What  is  the  optimal  set  of  habitat  variables  and  the 
relative  importance  of  each  variable  that  best  quantifies  or  describes  habitat 
structure  in  the  context  of  ordinal  environmental  gradients?  This  report 
derives  the  mathematical  form  of  environmental  gradients  and  shows  it  to  be  a 
generalized  eigenanalysis  solution  of  the  well  known  principal  components* 
solution . 


Canonical  Analysis  of  Di scriminance 

Terminology  in  discriminant  analyses  has  been  used  inconsistently  and 
inappropriately.  According  to  Pimentel  (78),  the  analysis  used  here  is 
canonical  analysis  of  discriminance  (CAD).  This  analysis  weights  predictor 
(habitat)  variables  such  that  their  linear  combinations  maximally  distinguish 
(discriminate)  among  two  or  more  predetermined  groups  or  classes.  The  F-ratio 
tests  the  criterion  for  measuring  class  differences  (65,  79).  Basically, 
F  =  SS^/SSw.  By  rewriting  sums  of  squares  terms  in  the  form  of  vectors  of 
linear  combinations  of  predictor  variables,  the  matrix  form  reduces  to: 

v'Bv/v'Wv  =  X  (see  Eq  37) 

Fisher  (31)  was  the  first  to  propose  X  as  the  discriminating  criterion.  The 
discriminant  problem  reduces  to  extracting  the  set  of  weights  that  maximizes 
X.  This  is  accomplished  by  the  Lagrange  multiplier  (see  Eq  38).  For  more 
details,  refer  to  Tatsuoka  (102),  Pimentel  (78),  or  Dillon  and  Goldstein 
(17).  Tatsuoka  (101)  presents  an  unusually  elementary  and  lucid  introduction 
to  discriminant  analysis.  Anderson  (A)  and  Seber  (90)  provide  advanced  treat¬ 
ments  . 

Discriminant  analyses  appear  to  be  potentially  valuable  tools,  since  the 
releative  importance  or  lack  of  significance  of  each  habitat  variable  can  be 
evaluated  according  to  its  ability  to  separate  treatments  (impacts)  from  con¬ 
trols.  However,  discriminant  analyses  pose  problems. 


*A  dimension  reduct  ion  ordinal  ion  technique. 
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Discriminant  analyses  have  many  practical  undesirable  features,  despite 
their  theoretical  advantages  and  widespread  use  by  researchers.  Several 
publications  (66,  108,  62,  110,  115,  116,  4,  90)  discuss  the  shortcomings  of 
discriminant  analyses  concerning  statistical  assumptions,  sampling  varia¬ 
bility,  and  analysis  interpretation.  Although  they  are  frequently  used  in 
ecological  studies,  stepwise  discriminant  procedures  have  been  criticized, 
since  many  environmental  variables  are  ususally  highly  correlated  (25,  40). 
Seber  (90)  presents  a  good  review  of  variable  selection  in  discriminant 
analysis  and  emphasizes  that  stepwise  procedures  strongly  depend  on  the 
assumptions  of  normality  and  homogeneous  dispersion  matrices.  The  most 
c  mmonly  used  statistical  packages  employing  discriminant  procedures  for 
selecting  variable  subsets  also  depend  on  these  assumptions  (42).  Not  only 
are  these  assumptions  difficult  to  test,  but  they  may  not  even  be  tenable. 
Furthermore,  ecological  data,  even  transformed,  rarely  meet  the  above  assump¬ 
tions.  Hawkins  (47)  has  developed  a  procedure  to  test  simultaneously  for 
homogeneity  in  the  covariance  matrices  and  multivariate  normality,  based  on 
the  Anderson-Darling  test  statistics  (22).  However,  my  experience  has  been 
that  stepwise  discriminant  analyses  are  totally  inappropriate  for  the 
selection  of  relevant  and  interpretable  variables,  unless  the  underlying 
ecological  pattern  is  very  simple  and  a  relatively  small  number  of  independent 
(predictor)  variables  are  involved. 

The  procedure  for  selecting  variable  subsets  in  this  research  was  a  two- 
level  process.  First,  variables  were  identified  by  their  ability  to  quantify 
the  major  environmental  gradient  present  in  the  study  sites.  The  relative 
quantitative  value  of  predictor  variables  was  determined  by  a  priori  orthog¬ 
onal  contrasts,  an  ANOVA  routine  (64).  Second,  various  logical  subsets  of 
these  variables  (based  on  experience)  were  subjected  to  CAD.  Inspection  of 
these  analyses  results  clearly  identified  the  subset  of  variables  that  not 
only  defined  a  prominent  environmental  impact  gradient  on  the  first  canonical 
variate  axis  and  produced  a  good  separation  of  study  sites  in  canonical  space, 
but  was  ecologically  interpretable  on  the  basis  of  first-hand  field  expe¬ 
rience  . 

The  number  of  di scr ' minat ing  variables  is  limited  to  the  number  of 
sampling  units.  Generally,  this  is  not  a  problem,  unless  the  researcher 
samples  only  several  transects  or  quadrants. 

Cluster  Analysis 

Cluster  analysis  is  a  powerful  analytical  tool  for  organizing  samples, 
observat i ons ,  or  objects  into  groups  based  on  predictor  (independent)  varia¬ 
bles  (1,  94,  45,  26,  21,  67,  84,  90).  Clustering  procedures  are  generally 
hierarchical.  In  other  words,  the  resulting  classification  produces  ranked 
classes  where  the  members  of  inferior  ranking  clusters  become  members  of 
larger  h  gher-ranking  clusters.  The  ranking  produces  the  well  known  dendro¬ 
gram  tree.  This  is  a  logical  procedure  for  taxonomic  data  and  tor  visualizing 
the  underlying  pattern  of  ranked  relationships.  Nonh i era rch i ca 1  methods  are 
potentially  useful  in  ecology  (67),  but  are  more  difficult  to  compute  and  not 
readily  available  in  data  analysis  sot (ware  packages.  These  methods  optimize 
wi t hi n-group  homogeneity,  rather  than  hierarchical  routes  between  objects  and 
major  clusters.  Objects  are  clustered  with  groups  on  I v  on  the  basis  of  their 
direct  relationships,  without  regard  to  hierarchical  affinities. 
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There  are  two  approaches  for  hierarchical  classification  of  n  objects. 
Using  divisive  methods,  n  objects  begin  in  a  single  group  and  are  separated 
into  two  groups,  which  in  turn  are  separated  into  four,  etc.,  until  only  a 
pair  of  objects  remain.  Divisive  methods  may  be  monothetic  or  polythetic. 
Monothetic  algorithms  consider  a  single  descriptor  variable  at  a  time.  This 
may  lead  to  erroneous  classifications  (39a,  15).  Polythetic  algorithms  use  an 
association  matrix  which  considers  the  complete  combination  of  many  varia¬ 
bles.  However,  for  realistic  data  (even  when  the  number  of  objects  and 
predictor  variables  is  moderate)  polythetic  methods  require  unrealistic 
computer  effort  (67).  Seber  (90)  discusses  advantages  and  disadvantages  of 
divisive  techniques.  The  other  approach  (agglomerative)  begins  by  examining 
all  individual  objects  and  pairs  the  two  most  similar,  then  the  next  pair  that 
is  most  similar,  merging  groups  accordingly  until  a  single  group  comprising  n 
objects  is  formed.  Agglomerative  methods  are  much  more  widely  used  and  highly 
recommended  (94,  67,  84).  Table  7.9  of  Legendre  and  Legendre  (67)  provides  a 
useful  synoptic  summary  of  clustering  methods. 


There  are  six  commonly  used  hierarchical  agglomerative  clustering 
algorithms  for  mainframe  computers:  (1)  the  unweighted  pair-group  method  that 
uses  arithmetic  averages  (UPGMA),  (2)  the  weighted  pair-group  method  that  uses 
arithmetic  averages  (WPGMA),  (3)  the  single-linkage  clustering,  (4)  the 
complete  linkage  clustering,  (5)  the  centroid  method,  and  (6)  Ward's  method 
(NTSYS,*  CLUSTAN  [120],  SAS  [87],  BMDP  [18],  SPSS*  [98]).  The  first  five  of 
these  algorithms  actually  fit  a  single  general  model,  and  therefore,  a  single 
computer  program  can  be  used  where  four  parameters  are  varied  to  select  the 
desired  algorithm  (66a,  66b).  Algorithms  are  executed  on  a  similarity  or 
resemblance  matrix.  This  matrix  consists  of  coefficients  (similarity, 
dissimilarity,  distance,  etc.)  that  reflect  the  degree  of  association  (or 
dissimilarity)  among  all  possible  pairs  of  the  samples,  observations,  or 
objects  to  be  classified.  There  is  a  very  wide  variety  of  resemblance 
coefficients  in  use,  including  various  correlation  coefficients,  cosines  of 
variable  vectors,  and  distance  measures.  Common  distance  measures  include 
Euclidean  distance  (usually  its  square),  Manhattan  metric,  and  arccosine  of 
the  correlation.  The  literature  provides  discussions  and  suggestions  for 
selecting  and  using  resemblance  coefficients  (92,  14,  1,  94,  83,  15,  19,  45, 
7,  9,  26,  21,  67,  84).  The  variables  or  attributes  describing  the  samples  may 
be  measured  on  nominal,  ordinal,  ratio,  or  interval  scales.  Counts,  frequen¬ 
cies,  or  even  presence/absence  binary  data  are  all  equally  appropriate.  At 
least  12  different  qualitative  resemblance  coefficients  have  commonly  been 
used  for  presence/absence  data.  Cluster  analyses  are  routinely  conducted  that 
mix  several  variable  scales.  It  is  common  practice  in  cluster  analysis  to 
transform  and/or  standardize  raw  data  values. 


Cluster  analysis  is  a  very  useful  classification  technique  for  grouping 
observations,  samples,  species,  study  sites,  objects,  etc.  (e.g.,  dependent 
or  criteria  variables)  on  the  basis  of  independent  or  predictor  variables 
(e.g.,  environmental,  morphological,  etc.  variables).  This  is  the  most 
frequent  use  of  cluster  analysis,  and  is  termed  Q-analysis.  The  independent 
or  "description"  variables  can  be  clustered  similarly  on  the  basis  of 
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dependent  or  sampLe  variables.  This  is  known  as  R-analysis.  R-analysis  is 
useful  for  reducing  the  dimensionality  of  experimental  data  sets. 


The  major  advantages  of  cluster  analysis  are  its  inherent  simplicity  and 
its  very  broad  applicability  to  any  kind  of  data,  sample  sizes,  etc.  However, 
there  are  drawbacks.  The  researcher  must  choose  from  among  a  variety  of 
clustering  algorithms,  and  each  method  usually  yields  different  (sometimes 
very  different)  results.  There  is  an  even  larger  choice  when  constructing  a 
resemblance  matrix.*  Again,  the  results  are  sensitive  to  the  similarity  or 
distance  coefficients  employed.  There  will  probably  never  be  a  "correct"  or 
optimal  methodology  for  cluster  analysis  (117,  76).  Most  statisticians  agree 
that  in  cluster  analysis  statistical  inference  or  hypothesis  testing  may  be 
tenuous.  However,  Dillon  and  Goldstein  (17)  and  Romesburg  (84)  do  not  agree 
with  this  consensus. 

An  excellent  application  of  cluster  analysis  is  exploring  the  effective¬ 
ness  of  independent  variable  subsets  on  classifying  entities  of  known  group 
affiliation,  as  done  in  this  study.  Here,  the  choice  of  a  clustering 
algorithm  and  the  form  of  the  resemblance  matrix  was  considered  very  carefully 
on  the  basis  of  previous  experience  and  the  literature  (82,  93,  94,  67,  84, 
90).  (Chapter  4  provides  d^ails.) 

Cluster  analysis  proved  highly  beneficial,  and  can  be  recommended  for 
exploring  the  effectiveness  of  various  independent  variables  to  classify 
sampling  units  of  known  group  affiliation. 

Cluster  analyses  using  the  Mojave  Desert  data  set  (described  in  Chapter 
4)  were  performed  to  contrast  this  technique  with  principal  components  and 
canonical  variate  ordinations,  and  to  illustrate  some  features  of  eigenanaly- 
sis  and  discriminant  analysis. 

Cluster  analysis  was  also  used  to  empi'ically  demonstrate  that  the  use  of 
a  reduced  n  x  4  factor  pattern  matrix  produces  rotated  principal  components 
solutions  that  are  ecologically  more  realistic  and  interpretable  than 
solutions  derived  from  the  full  n  x  n  matrix. 


*The  resemblance  matrix  is  developed  from  the  raw  data  matrix  and  consists  ot 
similarity,  dissimilarity,  distance,  or  correlation  coefficients,  directional 
cosines  or  some  quantifiable  measure  ot  the  "resemblance"  or  association  ot 
data  matrix  elements.  The  resemblance  matrix  is  the  input  tor  the  clustering 
algorithm. 


3  DATABASE  SELECTION 


Two  vastly  different  databases  were  analyzed  to  explore  the  effects  of 
sample  sizes  and  variable  selection  on  the  ordination  of  study  sites  in  both 
canonical  variate  and  principal  components  space.  The  importance  of  this 
evaluation  cannot  be  overemphasized.  In  the  literature,  researchers  initiate 
important  management  decisions  on  the  basis  of  inferences  from  their  results 
without  reference  to  the  process  of  variable  selection  (e.g.,  they  do  not 
state  how  or  which  variables  were  selected  for  their  analyses).  Thus,  it  is 
not  clear  if  their  results,  statistical  inferences,  or  ecological  interpreta¬ 
tions  were  sensitive  to  variable  selection. 

For  this  study,  the  two  databases  used  for  analytical  habitat  description 
were:  (1)  23  linear  transects  associated  with  Army  training  impacts  at  Fort 
Irwin,  California,  in  the  central  Mojave  Desert  (35°  38'  -  35°  8'  N,  116°  56' 
-  116°  19'  W)  and  (2)  369  square  quadrats  associated  with  strip-mine  succes¬ 
sion  in  east  central  Illinois. 

The  central  Mojave  Desert  consists  of  broad  valley  plains  between  rugged 
mountain  ranges  and  occasional  high  plateaus  and  dry  lake  beds  (playas).  Away 
from  the  mountains  are  alluvial  fans  or  bajadas  (coalesced  fans)  that 
gradually  slope  into  the  valleys.  Alluvial  substrates  are  gravelly  or  rocky 
since  they  are  formed  by  the  erosional  breakdown  of  the  mountains.  Boulder 
fields  or  large  rubble  can  be  found  at  the  base  of  the  mountains.  Washes 
occur  throughout  the  valleys,  bajadas,  and  alluvial  fans. 

USA-CERL  Technical  Report  N— 8  5/13  (64)  provides  map  coordinates  and 
elevations  for  all  of  the  Mojave  Desert  study  sites.  Appendix  A  provides 
general  ecological  characteristics  of  the  study  sites. 

The  nine  sites  in  Illinois,  called  B,  C,  D,  E,  F,  C,  H,  I,  J,  were  all 
within  5  km  of  one  another,  and  were  located  in  Kickapoo  State  Park  and 
Leisure  Time  Estates  in  central  Vermilion  County,  east  central  Illinois,  20  km 
west  of  the  Indiana  state  line.  Another  site  (site  A),  a  University  of 
Illinois  natural  area  (Phillips  Tract)  was  located  in  Champaign  County,  35  km 
west  of  the  other  sites.  Appendix  D  gives  general  ecological  characteristics 
and  sizes  of  the  Illinois  study  sites.  Most  of  the  study  sites  were  10  ha. 

Each  transect  or  quadrat  in  the  two  areas  will  be  referred  to  as  a 
sampling  unit.  The  Illinois  data  set  is  much  larger,  since  it  has  16  times 
more  sampling  units.  However,  the  Mojave  data  set  is  of  "higher  statistical 
quality."  This  terminology  is  not  meant  to  imply  better  or  more  accurate 
data,  but  is  based  on  two  factors:  (1)  sample  sizes  to  obtain  habitat 
parameters  within  each  Mojave  transect  were  usually  larger  than  those  used  to 
obtain  comparable  parameters  in  the  Illinois  quadrats,  and  (2)  the  size  of 
the  sampling  area  used  to  measure  habitat  variables  was  57  times  larger  on  the 
transect  sampling  units  (16  ha)  than  on  the  quadrat  sampling  units  (0.28 
ha).  Therefore,  by  sampling  more  intensely  and  by  covering  a  much  larger  area 
to  obtain  these  samples,  the  estimated  habitat  parameters  should  more 
accurately  represent  the  entire  study  site. 

These  two  experimental  designs  address  an  interesting  question.  If  two 
sampling  designs  have  equal  intensities  of  sampling,  but  one  site  contains 
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many  small,  plots  with  few  samples  per  plot  and  the  other  site  uses  larger 
plots  with  large  numbers  of  samples  per  plot,  will  the  multivariate  represen¬ 
tations  in  both  designs  be  comparable?  This  report  addresses  this  problem 
from  the  statistical  viewpoint.  However,  this  question  may  be  more  relevant 
to  biological/ecological  reality  than  to  statistical  theory.  Several  examples 
will  clarify  this  point. 

The  predominant  habitat  component  of  the  Mojave  Desert  is  the  three- 
dimensional  distribution  of  creosote  bush  (Larrea  t r identata ) .  Burroweed 
(Ambrosia  dumosa )  density  or  cover  and  substrate  particle  size  are  other 
important  components.  Therefore,  the  habitat  structure  is  relatively  simple, 
and  is  very  similar  from  place  to  place  (low  a  diversity).  Extreme  aridity 
results  in  low  spatial  and  temporal  productivity,  so  vegetation  is  sparse  and 
patchy  on  a  scale  of  10  to  50  m.  Therefore,  fewer  but  larger  sampling  units 
are  necessary. 

Contrastingly,  regions  with  higher  rainfall  have  not  only  more  dense,  but 
much  more  complex  vegetation  structure,  with  a  variety  of  trees,  shrubs, 
vines,  and  ground  cover.  A  high  degree  of  vegetation  complexity  increases  the 
potential  for  high  intrasite  variability  (high  a  diversity).  For  example,  a 
large  windblown  tree  opens  the  canopy  in  an  eastern  forest,  and  blackberry, 
raspberry,  or  rose  thickets  develop  along  with  grape  vines.  Similarly,  during 
a  series  of  unusually  wet  years,  local  depressions  in  a  forest  may  collect 
enough  water  to  saturate  the  soil  and  kill  flood-intolerant  tree  species,  thus 
producing  snags.  Both  of  these  patches  in  the  forest  represent  ecological 
opportunities  (food  and  nest  sites)  that  may  be  unavailable  or  rare  in  the 
closed  canopy  forest  interior.  Therefore,  when  habitat  complexity  is  greater, 
sampling  must  be  done  on  a  smaller  scale  to  quantify  habitat  mosaics, 
particularly  if  an  important  feature  of  the  research  is  to  decipher  species- 
habitat  relationships.  Furthermore,  if  primary  productivity  is  high  and  the 
vegetation  dense,  smaller  sample  sizes  may  be  sufficient  for  quantification. 
Therefore,  assuming  that  time  and  manpower  are  limited,  as  they  always  are, 
specific  experimental  designs  for  habitat  quantification  are  a  function  of 
habitat  complexity  or  primary  productivity. 


4  DESCRIPTION  OP  SAMPLING  VARIABLES  AND 
DATA  COLLECTION  AND  ANALYSIS  METHODS 


This  chapter  describes  the  variables  chosen  for  quantifying  habitat 
characteristics  at  each  study  site.  Also  described  are  the  methods  used  to 
gather  data  for  each  variable  and  the  data  analysis  methods. 


Mojave  Desert  Study  Sites  and  Habitat  Variables 

Five  study  sites  were  located  at  Fort  Irwin.  These  sites  were  chosen  to 
represent  three  levels  or  degrees  of  habitat  disturbance  (severely  impacted, 
moderately  impacted,  lightly  impacted)  and  two  unimpacted  control  sites. 
Typically,  vehicle  traffic  and  Army  training  activities  occur  in  the  valleys, 
and  impacts  decline  as  one  progresses  into  the  alluvials  toward  the  moun¬ 
tains.  The  severely  impacted  site  (S)  was  in  a  broad  valley  about  600  m  from 
the  main  road.  The  moderateLy  impacted  site  (M)  was  on  the  alluvial  about  3 
km  from  the  S  site.  Respective  control  sites  (valley  control  [VC]  and 
alluvial  control  [AC])  were  located  at  Goldstone,  an  unimpacted  portion  of 
Fort  Irwin,  leased  to  the  National  Aeronautics  and  Space  Administration/Jet 
Propulsion  Lab  for  tracking  deep  space  satellites.  A  fifth  study  site  (L)  was 
lightly  impacted  and  located  on  a  high  desert  plateau. 

Three  to  six  800-m-long  linear  transects  (sampling  units)  were  located  in 
each  of  the  five  study  sites  (Appendix  A,  N  =  23*).  Four  additional  transects 
for  sampling  habitat  variables  were  located  along  each  of  the  800-m  sampling 
units  (N  =  4  x  23  =  92).  These  habitat  transects  had  their  origins  at  the 
100-,  300-,  500-,  and  700-m  loci  of  the  800-m  transects.  The  compass  bearing 
of  the  habitat  transect  was  randomly  determined  by  casting  dice  and  employing 
a  technique  such  that  all  integers  from  0  through  359  had  an  equal  probability 
of  occurrence.  Habitat  transects  were  100  x  4  m  (50-  x  4-m  transects  were 
used  when  shrub  density  was  high).  Each  woody  plant  (height  >  0.2  m)  whose 
center  fell  within  the  transect  was  identified,  and  its  average  diameter 
(measured  at  its  maximum  horizontal  projection)  and  maximum  height  were 
recorded  to  the  nearest  0.1  m. 

Desert  shrub  individuals,  particularly  creosote  bush,  are  not  easily 
delimited  (23,  61).  Creosote  bush  scrub  consists  of  circular  or  semicircular 
bands  of  clonal  growth  patterns.  The  center  of  the  shrub  dies,  but  is 
replaced  on  the  periphery  by  new  stem  growth  from  the  roots.  The  resulting 
growth  pattern  is  a  ring  of  circular  or  elliptical  satellite  clumps  with  a 
barren  center  (99,  109).  Therefore,  delineation  of  shrub  individuals  was 
somewhat  subjective.  Shrubs  were  judged  as  individuals  when  their  foliage  was 
separated  by  at  least  10  cm  or  when  their  respective  radii  were  clearly 
outlined  (e.g.,  burroweed  foliage  occurs  in  dense,  compact,  spherical  clus¬ 
ters)  . 

Each  vegetation  transect  was  traversed,  and  every  2  m  a  sighting  tube 
with  crosshairs  was  used  to  record  50  "ground  hits"  from  the  following 
categories:  grass,  forb  (nonwoody  annual  and  perennial  vegetation),  litter 


'■'Total  of  transects  tor  all  five  sites  was  23. 
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(dry  and  decaying  plant  material  lying  on  the  ground  surface),  sand  (<  3  mm), 
coarse  sand  (3  mm  to  <  1  cm),  gravel  (1  to  <  8  cm),  and  rock.  (>  8  cm). 


Shrub  density  was  calculated  as: 


I  =  (104/16J  )  y  I  ^xs  jk^x  jk 

v  c  v  L-‘  L-‘ 


[Eq  1] 


j=l  k=l 


where: 


I  =  density  of  species  (or  height  category)  s  at  site  x 

( shrubs/ha) 

J  =  number  of  sampling  units  at  site  x 

N  number  of  individual  shrubs  in  the  kth  habitat  transect  of 

xsjk  rh  . 

the  j  sampling  unit  of  species  s  at  site  x 

Lxjk  =  length  of  the  kth  habitat  transect  of  the  jth  sampling  unit 

at  site  x. 

Shrub  cover  was  calculated  as: 


where: 


C„  .  ( 10^/4)  I  .(D  /2)  I  l  I  xjk 

1=1  J=1  k=l 


[Eq  2] 


Cxs  =  cover  species  (or  height  category)  s  at  site  x  (nr/ha) 

D  =  diameter  (m)  of  the  ith  individual  of  species  s  at  site  x 

xst 

Nxs  =  sample  size  of  species  s  at  site  x. 

Ground  cover  was  calculated  as: 


Gxg  =  (1/2V  l  l  xgjk 

j  =  1  k= 1 


[Eq  3] 


where : 

C  =  percent  ground  cover  of  g  (grass,  forbs,  or  litter)  at  site  x 

P  ..  =  number  of  "hits"  of  g  in  the  kth  habitat  transect  of  the  jth 

xgjk  ,  .  .  . 

sampling  unit  at  site  x. 

Substrate  particle  size  distribution  was  calculated  as: 


x  4  4x4 

s  =  y  y  p  .j  i  i  i  p 

XD  .  .  XD  lk  L ,  ■  ,  xt 


[  Eq  4  I 


i=l  k= 1 


p=l  j=l  k=l 


whe  re : 


SXp  =  percent  of  surface  substrate  of  particle  size  p  at  site  x 

P  ,  =  number  of  "hits"  of  particle  size  p  in  the  k1^1  habitat 

X  p  ]  K  ♦  u 

transect  of  the  j  sampling  unit  at  site  x. 


Horizontal  heterogeneity  was  defined  as  the  mean  within-transect  hetero¬ 
geneity  of  three  shrub  parameters:  diameter,  height,  and  density.  Within- 
transect  heterogeneity  was  the  coefficient  of  variation  calculated  from  the 
four  habitat  transects.  The  mean  from  all  800-m-long  sampling  units  at  a 
given  site  therefore  determined  horizontal  heterogeneity. 


Habitat  heterogeneity  was  calculated  as: 


where: 


H 

xp 


(1/J  ) 

x 


SD  ./X  . 

xpj  xpj 


[Eq  5] 


% 

P 

SD  • 
xpj 


X  . 
xpj 

Appendix  B  lists 
transformations 


index  of  horizontal  heterogeneity  of  parameter  p 
shrub  diameter,  height,  or  density 

standard  deviation  of  parameter  p  in  the  jth  sampling  unit  at 
site  x 

the  mean  of  parameter  p  in  the  sampling  unit  at  site  x. 

all  Mojave  Desert  habitat  variables.  Appendix  C  defines  the 
for  these  variables. 


Illinois  Study  Sites  and  Habitat  Variables 

Each  study  site  (with  the  possible  exceptions  of  A  and  F)  was  very 
carefully  selected  so  that  it  occupied  the  center  of  a  much  larger  area  that 
had  habitat  identical  to  the  respective  study  site.  Thus,  both  island  and 
edge  effects  were  eliminated.  All  sites  were  gridded  into  0.28-ha  square 
plots  (52.7  m  per  side).  Each  0.28-ha  plot  was  a  sampling  unit  (Figure  1).* 
Although  it  was  desirable  to  have  square  study  sites,  as  Figure  1  suggests, 
this  was  not  always  feasible  (e.g.,  bottomland  forest  sites).  A  circular  26- 
m-diameter  quadrat  (0.053  ha)  was  established  at  the  center  of  each  0.28-ha 
plot  to  quantify  habitat  structure.  The  method  for  quickly  delineating 
circular  quadrats  without  using  a  tape  measure  or  markers  is  given  in  James 
and  Shugart  (53).  The  following  sampling  scheme  was  used  in  each  26-m 
circular  quadrat.  At  each  of  the  20  sampling  points  "A"  and  "B"  (Figure  2),  a 
heavy  cardboard  tube  fitted  with  crosshairs  was  used  to  determine  the 
presence/absence  of  vegetation  for  ground  cover  (vegetation  <  1  m  high)  and 
canopy  cover  (vegetation  >  2  m  high).  To  eliminate  focus  bias,  the  ocular  was 
positioned  at  hip  level  or  above  the  head  before  looking  through  it.  A  hit 
was  scored  as  positive  if  vegetation  was  observed  at  the  intersection  of  the 
crosshairs.  Also  recorded  at  the  same  time  with  this  method  was  the 
presence/absence  of  aquatic  habitats  and  whether  the  ground  was  level  or 
sloped.  At  each  of  the  12  sampling  points  "B,"  the  presence/absence  of 
vegetation  at  11  vertical  levels  was  determined,  based  on  the  intersection  of 
vegetation  with  a  7.5-m  telescopic  measuring  rod.  Levels  greater  than  7.5  m 
were  estimated.  Appendix  E  gives  the  height  categories  used. 


♦Tables  and  figures  appear  at  the  end  of  this  report  on  pp  51-106.  They 
appear  in  the  same  order  in  which  they  are  referenced. 
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Vertical  hits  were  calculated  as: 


u 

l.  =  do)  y  s.. 
J  i=l  J1 


where: 


L-  =  hits  at  jcn  level 
J  j  =  1  to  11 
i  =  individual  sampling  points 
Sj^  =  1  for  presence 

=  0  for  absence 

n  =  number  of  sampling  points  (12). 
Vegetation  volume  (VEGVOL)  was  calculated  as: 


m  n 

VEGVOL  =  (10)  l  Is.. 

j=l  i=l  J1 


where : 


m  =  number  of  vertical  levels  (11). 


Horizontal  heterogeneity  (HZHW)  was  calculated  as: 


where : 


HZHW  =  (10)  [x  (max)  -  X  (min)  (SD/X,,) 
1  q  q  ‘  W  W 


W  =  within  quadrat 


Xq  =  VEGVOL  in  transect  q 

SD  =  standard  deviation. 
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(1/4)  l  X 


q=l  q 


Vertical  heterogeneity  (VHW)  was  calculated  as; 


VHW  -  (lo)  I  1/p2 


where : 


Pj  =  proportion  of  hits  in  j  layer 


Pj  =  Lj /VEGVOL 


Ground  cover  (GC)  was  calculated  as: 
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GC  =  (5)  l  sG 
i  =  1  i 


[Eq  6 


[Eq  7 


[Eq  8 


Eq  10 


Canopy  cover  (CC)  was  calculated  as: 

CC  =  (5) 
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(Eq  11] 


Level  ground  (TOP)  was  calculated  as: 

20 

TOP  =  (5)  l  sT 
i  =  1  i 


where: 

ST^  =  1  for  hit  on  level  ground 
ST^  =  0  for  hit  on  slope. 


Aquatic  habitat 


(AQUA)  was  calculated  as: 


AQUA  =  (5) 


20 


where: 


1  for  hit  on  aquatic  site 
0  for  hit  on  terrestrial  site. 


(Eq  121 


(Eq  13] 


Shrub  density  and  heterogeneity  (Figure  3)  were  estimated  by  walking 
along  each  of  the  four  perpendicular  transects  with  the  arms  fully  extended 
perpendicular  to  the  body.  The  total  number  of  vertical  woody  stems  <  3  cm 
diameter  breast  height  (dbh)  transected  by  the  arms  was  recorded  for  each  of 
the  four  transects.  The  intersections  were  at  a  height  of  1.5  m.  The  closest 
shrub  (woody  vegetation  with  stem  diameter  <  3  cm  and  height  >  1  m)  in  each 
quarter  from  locus  C  (Figure  3)  was  identified;  its  mean  crown  diameter  at 
maximum  horizontal  projection  and  its  distance  from  C  (<  13  m)  were  recorded. 


Shrub  density  (SHRD)  was  calculated  as: 

4 

SHRD  =  (1000/92.56)  V  S 

*-•  C. 


where : 


w=l 


S  =  number  of  woody  stems  intersected  in  transect  w. 


[Eq  14] 


Shrub  heterogeneity  (SHRH)  was  calculated  as: 

SHRH  =  (10)  [S  (max)  -  S  (min)l  (SD,,/Sri) 
1  w  w  1  W  W 


where : 


SW  =  <^>  l  Sw 

w=l 

W  =  within  quadrat 
SD  =  standard  deviation 


(Eq  15] 
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Shrub  cover  (SHRC)  was  calculated  as: 


SHRC  =  ti ( y  /2)2  (SHRD)  [  Eq  16] 

w 

=  mean  shrub  diameter 
16 

=  d/16)  l  y 

k=l 

Within  the  26-m  circular  quadrat,  the  height  of  the  dominant  or  most 
abundant  vegetation  layer  ( CANAV )  as  well  as  the  highest  vertical  projection 
of  vegetation  (CANMX)  were  measured  with  the  7.5-m  telescopic  measuring  rod  or 
an  accurately  calibrated  TOPCON  clinometer/range  finder. 


where : 


w 


Each  tree  whose  bole  center  fell  within  the  26-m  circular  quadrat  (dbh  > 
3  cm)  was  identified  and  placed  into  one  of  10-dbh  categories  (see  Appendix 
E).  The  basal  area  categories  were  measured  on  a  dbh-cal ibrated  meter  stick, 
but  with  experience,  trees  could  be  classified  accurately  from  distances  of  up 
to  13  m. 


Total  basal  area  of  trees  (TREMAS)  was  calculated  as: 

4  10  2 

TREMAS  =  (n/10  )  l  N  (d  /2) 


b*l 


where : 


Nb  =  total  number  of  trees  of  size  category  b 
d,  =  mean  dbh  of  size  category  b. 


Basal  area  of  snags  (DDEA)  was  calculated  as: 

4  10 

DDEA  =  (n/10  )  y  Nu  (d  /2 )‘ 

b  b 
b=2  s 


where : 


=  total  number  of  snags  of  size  category  b. 
us 


[Eq  17] 


[Eq  18] 


The  basal  area  ot  a  specific  tree  species  or  tree  guild  was  calculated  in 

the  same  manner  as  TREMAS,  except  that  N.  was  equal  to  the  number  of  trees  of 

x 

species  (or  guild)  x  of  size  category  b.  Appendix  E  lists  all  44  Illinois 
habitat  variables.  Appendix  F  provides  a  complete  list  of  all  tree  species 
and  guild  memberships,  and  scientific  names.  All  variables  were  transferred 
as  Vx  =  loge  (Vx  +  1). 


Illinois  habitat  variables  were  grouped  into  four  subsets  for  analysis. 


1.  The  "structural"  subset  consists  of  variables  defining  the 
presence/absence  or  distribution  of  vegetation  in  a  three-dimensional  grid 
superimposed  on  each  quadrat,  including  the  aquatic  and  topographic  variables. 

2.  The  "typical"  subset  consists  of  variables  generally  seen  in  the 

literature  and  that  researchers  use  consistently  to  quantify  bird/habitat 
relationships:  ground  and  canopy  covers,  measures  of  shrub  abundance  and 
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patchiness,  canopy  heights,  species  richness  of  shrubs  and  trees,  and  the 
densities  of  various  size  classes  of  trees. 

3.  The  "typical  +  trees"  subset  is  subset  number  2  (discussed  above) 
with  the  addition  of  variables  quantifying  basal  areas  of  tree  species  or 
guilds  (e.g.,  the  addition  of  floristics  components). 

4.  The  subset  "all"  represents  the  use  of  ail  variables. 
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Data  Analysis 

The  SAS  (87)  statistical  package  and  the  IBM  mainframe  computer  were  used 
for  all  multivariate  analyses.  Principal  components  solutions  were  obtained 
from  Pearson  product-moment  correlation  matrices,  and  were  varimax  rotated. 
Crucial  to  the  principal  components  analysis  was  the  extraction  of  only  four 
principal  components  (PCs).  Chapter  5  indicates  that  PC  1  is  extremely 
important;  not  only  is  its  ecological  relevance  critical,  but  its  respective 
eigenvalue  explains  the  largest  portion  of  sample  variance.  Generally,  PC  2 
and/or  PC  3  may  also  be  important  and  interpretable,  describing  other 
ecological  features  unrelated  (or  indirectly  related)  to  the  major 
environmental  feature  defined  by  PC  1.  Principal  components  higher  than  the 
first  three  or  four  are  generally  not  interpretable,  and  cumulatively  explain 
a  very  low  portion  of  total  sample  variance.  This  is  particularly  true  with 
environmental  data  sets. 

The  possible  number  of  extractable  principal  components  is,  in  principle, 
equal  to  the  number  of  variables  in  the  data  set,  but  the  higher-order 
components  may  have  eigenvalues  very  close  to  zero.  Chapter  5  shows  that  the 
use  of  varimax  rotation  enhances  the  ecological  interpretation  of  PC  2,  PC  3, 
and  sometimes  PC  4.  However,  the  structure  of  the  rotated  factor  pattern 
matrix  depends  strongly  on  the  number  of  principal  components  transformed  by 
the  rotation.  If  principal  components  are  extracted  from  an  n  variable  data 
set,  the  n  x  n  factor  pattern  matrix  is  obtained.  Multiplying  this  matrix 
with  an  n  x  n  transformation  matrix  results  in  the  n  x  n  orthogonally  rotated 
factor  pattern  matrix.  Using  the  above  routine,  the  loadings  on  the  first 
four  rotated  principal  components  are  different  from  those  obtained  if  the  n  x 
4  factor  pattern  matrix  is  used.  (The  n  x  4  matrix  uses  only  the  first  four 
principal  components.)  The  reason  for  the  discrepancy  can  be  attributed  to 
the  structure  of  the  transformation  matrix: 


where : 


B  =  AT  1 Eq  19] 

A  =  the  n  x  2.  factor  pattern  matrix 

T  =  the  2  x  2  transformation  matrix 

B  =  the  n  x  2.  rotated  factor  pattern  matrix 

n  =  number  of  variables 

8.  -  number  of  principal  components. 

As  the  number  oi  principal  components  that  go  into  the  rotated  solution 
increases,  the  size  of  the  2x2  transformation  matrix  increases.  This  matrix 
is  constructed  from  systematic,  successive  orthogonal  rotations  (transforma¬ 
tions)  such  that  each  axis  is  rotated  with  every  other  axis  only  once. 
Therefore,  the  final  complete  orthogonal  rotation  is  actually  a  product  of 
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many  simpler  orthogonal  rotations.  Because  of  orthogonality,  the  elements  of 
T  are  subjected  to  the  following  important  constraint: 

8. 

( i , j  =  l *2 , 


l 

k=l 


X,  .X.  . 
ki  k j 


=  d.  . 
1 J 


i 


j) 


[Eq  20] 


ij 


The  number 
parameters 


of 

in 


1 J 

condi t ions 


=  1 
=  0 


when  i=j 
when  i*j 


is  2. ( S.  +  l)/2.  Since  there  are  £  possible 


such 

matrix  T,  there  remain  2(2  -  1 ) / 2  degrees  of  freedom.  The 
construction  of  T  by  successive  orthogonal  transformations  with  the  above 
constraint  inherently  reduces  factor  loadings,  obscuring  the  relationships 
between  the  rotated  solution  and  the  original  variables.  When  2,  is  small,  the 
diagonal  elements  of  the  transformation  matrix  have  high  loadings  (e.g.,  0.6, 
0.7,  0.8,  etc.);  however,  as  2  increases,  diagonal  elements  have  low  and 
inconsistent  values.  In  other  words,  with  increasing  2  (number  of  principal 
components),  the  interpretation  of  the  rotated  solution  with  respect  to  the 
original  variables  becomes  increasingly  confusing. 


A  common  error  is  to  obtain  all  possible  principal  components  from  a  data 
set,  rotate  their  solution,  and  subsequently  interpret  the  principal  compo¬ 
nents  analysis  by  retaining  the  first  few  principal  components.  For  this 
study,  the  n  x  4  factor  pattern  matrix  has  been  used  to  obtain  varimax  rotated 
solutions;  this  has  optimized  the  ecological  relevance  and  interpretation  of 
principal  components  analysis.  Similarly,  canonical  analysis  of  discriminance 
was  performed  using  only  the  first  four  canonical  variates. 


The  clustering  algorithm  used  was  the  hierarchical  agglomerat ive 
unweighted  pair-group  method  using  arithmetic  averages  (UPGMA),  which  is  also 
called  average  linkage  clustering.  This  method  was  developed  by  Sokal  and 
Michener  (95).  Squared  Euclidean  distance  was  used  to  construct  the  resem¬ 
blance  matrix.  Habitat  variables  were  transformed  (Appendix  C)  and  then 
standardized  to  Z  scores  (the  means  are  0  and  the  variances  are  1)  before 
analysis.  Principal  component  and  canonical  variate  scores  were  not  trans¬ 
formed  or  standardized.  This  methodology  was  found  to  be  robust  and  effective 
with  a  wide  diversity  of  databases  and  consistently  superior  to  other 
approaches.  The  dendrograms  produced  by  unweighted  clustering  have  higher 
cophenetic  correlations  with  resemblance  matrices  than  dendrograms  produced  by 
weighted  clustering  (93).  Sneath  and  Sokal  (94),  Romesburg  (84),  and  Seber 
(90)  also  strongly  recommend  the  UPGMA.  A  dendrogram  scale  is  not  shown  in 
any  cluster  analysis  for  this  study,  since  only  the  relative  relationships  of 
the  nodes  are  of  primary  concern. 
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5  DATA  ANALYSIS  RESULTS 


Theoretical  Foundation  for  Environmental  Gradients 


A  series  of  m  study  sites,  H^ ,  is  identified  in  which  the  magnitude  of  h 

reflects  the  relative  degree  of  environmental  impact.  Thus,  is  a  control 

site,  and  H  is  the  most  impacted.  A  suite  of  p  habitat  (environmental) 
’  m 

m 


variables, 


i  s 


obtained  with  total  sampling  units 


A  sampling  unit 


is  a  transect  or  quadrat  of  any  size  or  shape  within  which  Xj's  are  mea¬ 
sured.  Values  of  Xj  are  sample  estimates  of  population  parameter:  means, 
standard  deviations,  coefficients  of  variation,  etc.  The  N  x  p  data  matrix 
for  site  Hh  is  : 


X 1  lh  X21h  .  Xplh  ! 

X12h  X22h  .  Xp2h  ] 

1 

Xh  =  .  .  .  [ Eq  21  ] 


XlNhh  X2Nhh 


>hh 


The  max i mum- 1  ike  1 i hood  estimate  of  mean  habitat  vector  u.  in  the  hth 

—  fc.  U  1  * 

site  is  the  sample  mean  vector  x^  of  the  hL  site.  Each  habitat  site  is 
represented  in  p-dimensional  habitat  space  by  the  mean  habitat  vector  x^ . 


N 

=  i/N  y 
i=l 


'  1  i  h  , 


N 

1/N  l 
i  =  l 


2 ih , *  *  * 


N 

ml 

i=i 


X.. 

p  i  h 


[Eq  22  1 


x 


h 


X 


By  the  definition  used  here,  the  major  difference  between  the  study  sites 
is  attributable  to  environmental  impacts.  (There  are  generally  other 
differences  as  well.)  Therefore,  the  loci  of  study  sites  in  p-dimensional 
habitat  space  will  be  ellipsoidal,  with  the  major  axis  corresponding  to  the 
habitat  variable  closely  associated  with  characterizing  the  environmental 
impact.  This  would  be  the  simple  case  if  all  habitat  variables  were  indepen¬ 
dent.  However,  many  environmental  variables  describing  habitat  structure  are 
highly  correlated.  Although  statistically  a  problem,  this  is  an  advantage  tor 
the  ecologist,  since  more  than  one  measure  is  available  for  quantifying 
habitat  attributes  such  as  disturbance.  The  realistic  problem  becomes  to 
identify  which  habitat  variables,  and  especially  their  relative  contributions, 
express  the  greatest  variance  in  the  p-dimensional  dispersion.  Solving  this 
problem  will  provide  the  analytical  tool  to  quantify  ecological  attributes 
along  this  environmental  impact  gradient. 


A  necessary  background  is  establishing  the  equivalence  of  coordinate 
system  rotations  and  Linear  transformations.  This  is  conveniently  demon¬ 
strated  for  the  two-dimensional  case  but,  of  course,  has  no  graphical  analogy 
when  the  dimensions  exceed  three.  Examining  the  axis  rotation  in  Figure  A, 
where  the  original  axes  are  and  A2  and  a  rotation  of  0  degrees  oroduces  new 
axes  B.  and  By,  (0  defines  the  angle  between  the  original  axis  a  and  a 
rotated  axis  6). 


0  -  9^  =  0^2  =  Qq  =  ®d  (from  plane  geometry) 

cos  0  =b/(a,+z.) 

c  1  1  1 

tan  O22  =  z\^a2  Z1  =  a2s^n  O/cos  0 

=  (cos  0)a^  +  (sin  0)a2  [ Eq  23a] 

cos  0^  =  ^>2^z2 

tan  0^  =  (a2  -  z^)/a^  ^2  ~  &2  ~  ais*n  °/COs  0 

•  •  b2  =  (-sin  0)a^+  (cos  0)a2  ( Eq  23b] 

The  original  coordinates  of  Z  were  (a^,a2).  The  new  coordinates  of  Z 
under  rotation  0  are  (bpb2)  or,  in  terms  of  the  original  coordinates: 

[ (a^cos  0  ♦  a2sin  9),  (-a^sin  6  +  a^cos  0)| 

Two  important  factors  should  be  noted: 

1.  Development  of  the  new  coordinates  produced  equations  where  new 
variables  were  defined  that  are  linear  combinations  of  the  original  variables? 
mathematically,  this  is  a  linear  transformation. 


2.  If  the  locus  Z  represented  the  apex  of  mean  vector  x,  neither  the 

location  of  x,  or  more  importantLy,  the  dispersion  of  the  raw  data 


(Xi*  X2 , - . • • -Xp  j  (a1 , 
Xj  s  relative  to  one 
t  ransf  or mat  ion  ) . 


a2  in  the  two-dimensional  case)  around  x,  or  the  loci  ot 
another  are  affected  by  the  axis  rotation  (linear 


cos  0  ♦  sin  0=1  (from  trigonometric  identities) 


Therefore,  in  any  linear  combination,  such  as  Eq  23,  or  more  generally: 


nlXl  +  n2X2 


[Eq  2Aa] 
(  Eq  24b] 


there  exists  an  axis  rotated  by  0  degrees  where  nj  =  cos  6  and  n2  =  sin  9. 
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Also  note  from  Eq  23  that: 


(cos  6)(-sin  9)  +  (sin  0)(cos  0)  =  0 
or  more  generally,  if: 


[Eq  25a] 


V,  -  njjX,.  n21X2 


Y„  =  n,,X,  ♦  n._X„ 

2  12  1  22  2 


and  *  ^21^22  — 


[Eq  25b] 
[Eq  25c] 


the  axes  Y^  and  Y2  are  perpendicular  to  each  other  (e.g.,  orthogonal). 

To  generalize  these  results  to  higher  dimensions,  it  is  expedient  to 
express  Eqs  23a  and  23b  in  the  form  of  directional  cosines  (Figure  4). 


e>12  =  o  +  90° 
e21  =  -(90o  -  e) 

•  •  sin  0  =  sin  (0^  +  90°)  =  cos  0^ 

-sin  0  =  -sin  (0^-  90°)  =  cos  0 
Therefore,  an  equivalent  form  for  Eqs  23a  and  23b  becomes: 


b ^  =  (cos  Ojj)3!  *  (cos  °21^a2 
b2  =  (cos  °12^al  +  ^cos  °22^32 


or  for  p-dimensions : 


[ bj , b2 , • •  •  , bp ]  [ a  ^ , a  2  y •  ■  •  *  ap ] 


cos  „  can  be  written  as  1  „  for 
aB  aB 


COS  1 1 

cos  12  • • 

...  coslp 

c°S2i 

• 

.2p 

cospl 

• 

* 

p2  *  * 

*•  co:>pp 

convenience. 


[Eq  26] 


is  equivalent  to:  (1)  the  linear  transformation  coefficient  of 
element  aB  or  (2)  the  cosine  of  the  rotation  angle  between  the  original 
axis  a  (a  =  1,2,.. .,p)  and  the  rotated  axis  B  (6  =  1,2,... ,p). 


The  equivalence  of  a  linear  transformation  and  rigid  axis  rotation  is  now 
evident . 

The  condition  on  rigid  orthogonal  rotation  can  now  be  specified  by 
expanding  Eqs  24  and  25  to  higher  dimensions. 


I  a.  a 

j=l  ->a  Jfi 


a  =  1,2,  ...  p 
6  oi ,  oi+ 1 ,  . . . 


(Eq  27] 


=  1  if  a  =  B 


*  0  if  oi  *  B 


In  matrix  notation,  Eq  25  can  be  written  as  a  row  vector: 


b'  =  a'L 


(Eq  28] 


with  L  being  the  linear  transformation  matrix,  or  equivalently  as  a  matrix: 


B  =  AL 


(Eq  29] 


for  an  N  x  p  matrix 


for  any  b^ 


8. ,  .a.  -  +  a..a~.  +  ....  +  Jl  .a  . 

16  li  26  2i  p6  pi 


=  1/N  l  b  .  =  1/N  l  ( t.  a. .  +  a  .  + 

■L ,  6 l  . L ,  161i  26  pi 

i=l  1=1  r 

N  N 

=  a^O/N  l  a^)  +  a26(l/N  £  a2i)  +  ... 

i=l  i=l 


...  +  a  .a  .  ) 

p6  pi 


+  a  .(1/N  y  a  ) 
P6  i=i  P{ 


8-1  J.3.  ^  2.  O  f  ^  ^  •  •  •  •  1  *  fll 

16  1  26  2  p6  p 


Therefore,  in  vector  form: 


b*  =  a'L 


[Eq  30] 


In  matrix  form: 


B  =  AL 


(Eq  31] 


The  constraint  on  Eq  27  in  matrix  form  is: 


L'L  =  I 


(Eq  32] 


where  I  is  the  identity  matrix. 


Eq  32  defines  the  transformation  matrix  L  as  an  orthogonal 
it  is  nonsingular  since: 


L'LL  1  =  IL_1 
L’l  «  IL'1 


square  matrix; 


The  problem  of  interest  is  to 

transforms  the  data  matrix  X  into  Y 

with  the  major  axis  of  the  eLlipsoid 

m 

where  [i  =  1,2,  ....  £  ] . 


find  the  transformation  matrix  L  that 
such  that  a  rotated  axis  (Yj)  coincides 
of  raw  data  values  (X^,  *2-,  •••  *pi^> 


The  raw  data  matrix  must  be  in  the  form  of  a  dispersion  matrix  (e.g., 
sums  of  squares  and  cross-products),  since  there  is  interest  in  data  variance 
(see  Appendix  G).  In  the  well  known  univariate  case: 


(Eq  33] 


where  X  =  ^X/N  and  x  =  X  -  X. 

r-2  .  -  2 

^X  is  generally  seen  in  the  form  N(X)  . 

The  matrix  form  of  Eq  33  is  analogous,  and  the  (p  x  N)  data  matrix  can  be 
represented  over  all  habitat  sites  by: 


A(X )  =  X'X  -  X*X  = 


I*  X  .  ...  lx  X 

12  1  j 


Vx  X 
2  1 


b  ....  X 

2  2  j 


Vx  .X  Jx  .X  .  .  .  .  Tx2 

L  J  1  L  J  2  L  j 

Similarly,  A(Y)  =  Y'Y  -  Y’Y. 

Substituting  Eqs  29  and  31  into  Eq  35: 

A(Y)  =  (XL) ' (XL)  -  (XL) ' (XL) 

=  L'X'(XL)  -  L ' X 1 ( XL ) 

=  L* (X'X)L  -  L1 (X'X)L 
=  L'(X'X  -  X'X)L 


[Eq  34] 


52 


'  •**  a'*  m  •  •  *  *  ”  •  ' 

"g  *  *  *  a  ■  m  '  .  '  i  *  »  •  .  **  *  '  « 
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v-  ^  - 


(Eq  35] 


From  Eq  34: 


A(Y)  =  L'A(X)L 

As  discussed  above,  we  are  interested  in  the  transformation: 


[Eq  36] 


Y-  =  i..  X,  L.  X,  + - +  ft  .  x  . 

J  ij  1  2j  2  PJ  P 


r  2 

The  dispersion  of  Y  is  given  by  ^y  •  From  Eq  36: 


\y^~  v'A(X)v 


1 


where  v  is  the  vector  .  or  p  x  1  t rans format i on  matrix 


[Eq  37] 


i 

P 

The  problem  is  to  construct  v  such  that  £y  is  maximized.  Maximizing 

t  2  .  .  .  p  2 

'Ly  is  meaningless  without  a  stipulation  on  v.  Otherwise  l y  would  be  +  =>. 

Since  we  only  want  to  deal  with  rigid  orthogonal  rotations,  Eq  32  identifies 
the  constraints:  v'v  =  1  for  a  vector.  Maximizing  a  function  with  an  associ¬ 
ated  constraint  is  done  using  the  Lagrange  multiplier  (X). 

C( x , y , 2 )  =  F(x,y,z)  -  X$(x,y,z)  [ Eq  38] 

F ( x , y , z )  can  be  maximized  if: 


<t>(x,y,z)  =  0 


and 


.  o  IS  =  0  ^ 

3x  3y  3z 


=  0. 


In  this  case,  C(v)  =  F(v)  -  X <t> ( v)  (see  ref.  4). 


~  =  2 A( X  )v  -  2 X I v 
3  v 

when 


IS 

3  v 


=  0 


(A(X)  -  X I  | v  =  0  X  a  scalar 


[Eq  39] 


It  (A(X)  -  XI]  is  nonsingular,  a  possible  solution  to  Eq  37  is  trivial  (e.g., 
i  he  no  1  1  vector ) : 

-  | A( X  )  -  XI 1 “ 1 [ 0  ] 


V 


Therefore,  [A(X)  -  XI]  must  be  singular  for  a  nontrivial  solution;  mathemati¬ 
cally,  its  determinant  must  vanish. 


|A(X)  -  XI |  =  0  [Eq  40] 

This  is  the  well  known  "characteristic  equation,"  with  p  characteristic 
roots  X.,  or  eigenvalues  of  the  matrix  A(X);  p  equals  the  rank  of  A(X).  Since 
A(X)  is^a  symmetric  matrix,  all  eigenvalues  are  real.  Furthermore,  if  A(X)  is 
nonsingular,  no  eigenvalue  equals  zero.  From  Eq  37,  each  eigenvalue  X.  has 
associated  with  it  an  eigenvector  v j .  (See  Appendix  A.) 

Solving  Eqs  39  and  40  is  routine;  a  pertinent  discussion  is  available  in 
any  matrix  algebra  text,  such  as  ref.  89. 


Dividing  Eq  37  by  N  (technically  N-l  for  an  unbiased  estimate),  and 
specifying  Y: 


var  Y.  =  v'.S(X)v. 
J  J  J 


S(X)  is  a  covariance  matrix 


From  Eq  39: 


S(X)v-  =  X . v • 
J  J  J 


var  Y 


=  V;X  V; 


J  J  J 


since  v'v  =  1  and  X.  is  a  scalar 

J 


var  Y  ■  =  X . 
->  J 


[Eq  411 


Eigenvalue  X.  is  identified  as  the  variance  of  the  jth  transformed 
variable,  or  equ i va l ent Ty ,  the  jth  linear  combination  of  original  data 


variables  (X^j,  X2 j 


+ 

PJ  P 


....  Xpj>,  -here  Yj  -  ♦  t2jX2  .  ... 

Since  we  maximized  variable  Y  in  Eq  37  using  Eq  38,  the  first  solution 


variable  Y^  =  X^  is  a  maximum. 


Y^  is  therefore  the  desired  transformed 


variable,  and  equivalently,  Y^  is  the  major  axis  of  the  ellipsoid 


of 


£  N  points  in  p-dimensional  space  of  habitat  variables.  We  can  solve 


h  =  l 


tor  any  Y^h  =  For  each  site  H^,  a  mean  score  is  calculated  for  Y^ 


from  the  raw  habitat  variables  Xj-^. 


fnr  (,,VJ 


i  =  l 


*ihVl 


[Eq  42] 


Yj  can  be  identified  as  the  environment  impact  gradient.  Also: 


)■  X.  =  tr  A(X) 


(tr  is  the  trace  or  sum  of  the  diagonal  elements  of  a  matrix.)  Since  in  a 
covariance  matrix  the  diagonal  elements  are  variances: 


From  Eqs  41  and  43: 


£  X.  =  tr  S(X)  =  £  var  X. 

J  j=l  J 

I  var  Y.  =  y  X.  =  y  var  X. 

L  J  L  j  L  J 


[Eq  43] 


[ Eq  44 ] 


The  sum  of  eigenvalues  represents  not  only  the  total  variance  of  the 
transformed  variables,  but  also  the  total  variance  in  the  original  data  when 
the  data  is  in  the  form  of  a  covariance  matrix.  The  proportion  of  variance 
explained  by  the  first  transformed  variable  is  X^/  £  var  X^. 

A  reader  familiar  with  multivariate  methods  would  recognize  the  above 
results  as  a  principal  components  solution,  with  Y^  being  PC  1.  Actually,  p- 
orthogonal  principal  components  can  be  extracted  from  an  N  x  p  data  matrix, 
with  each  successive  principal  component  explaining  a  decreasing  proportion  of 
total  sample  variance. 

Any  two  eigenvectors,  v  and  v  (a*B)  are  independent  or  orthogonal 


A(X)v  =  X  v 
a  a  a 
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Eq  45 
Eq  46 


A'(X)  =  A(X)  (SSCP  as  well  as  covariance  or 

correlation  matrices  are  symmetric.) 


From  Eq  46: 
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Eq  4  7 


Substituting  Eq  47  into  Eq  46: 


X  v'  vn 

a  a  8 


v '  v  (  X 
a  8  a 


X8VaV8 

x6)  -  0 


-.V.V'. -.-W v'v"’.-’Vv -"v‘v'v‘- V-'.'-V.'-V- 


Since  X  *  X  : 
a  6 


-A.  is  che  covariance  matrix  of  the  transformed  variables  Y^,  Y2 ,  . ...Y  .  This 

matrix  shows  that  the  variance  of  each  Y  is  X  and  that  the  covariance  or 

Cl  OL 

correlation  between  Y  and  Y  is  0,  a*0  (i.e.,  they  are  independent). 

Cl  0 

By  their  very  nature,  habitat  variables  represent  the  measurement  of  a 
wide  variety  of  environmental  attributes.  Obviously,  variables  will  be 
measured  on  different  scales:  dimensions  (linear  as  well  as  exponential 
functions),  weights,  degrees  or  radians,  time,  temperature,  proportions  or 
percentage,  counts,  variances  or  coefficients  of  variation,  dimensionless 
indices,  etc.  However,  they  are  all  interval  or  ratio  measures  and  are 

desirable  because  of  their  statistical  properties.  However,  it  may  be  more 
appropriate  to  quantify  some  (sometimes  ail)  variables  on  an  ordinal 
ranking.  An  example  would  be  the  arithmetic  rating  scale  of  1  to  10,  or  even 
a  geometric  series,  such  as  an  exponential  or  log  normal  scale.  Another 
important  scale  in  collecting  ecological  data  is  presence/absence,  where  data 
values  are  either  1  or  0.  Not  only  are  all  these  scales  vastly  different,  but 
even  with  variables  of  the  same  measure,  the  relative  magnitudes  and/or 

variances  may  vary  considerably  among  environmental  attributes.  Generally, 
the  relative  magnitudes  of  means  and  variances  are  positively  correlated. 

Following  are  some  common  examples  of  habitat  variables  of  different  scale: 
shrub  height  or  diameter  (m),  shrub  cover  (m  ),  shrub  volume  (mJ),  shrub 
density  ( ind i vidual s /ha ) ,  substrate  particle  size  (mm),  distance  from  similar 
habitats  (km),  primary  production  (g/m  ),  ground  cover  (%),  slope  aspect 
(degrees),  shrub  heterogeneity  (coefficient  of  variation),  vertical  vegetation 
heterogeneity  (a  diversity  index),  relative  soil  moisture  (1  to  5),  and 
presence /absence  of  shrub  species  (1  or  0). 

Another  very  important  consideration  of  scaling  is  that  it  is  of  interest 
to  identify  high  relative  variance  components  that  are  closely  associated  with 
the  environmental  gradient,  rather  than  high  variance  per  se. 


A  simpLe  way  to  standardize  all  environmental  variables,  regardless  of 
scale  or  measure,  is  to  construct  standard  scores  for  the  variables. 
Therefore,  for  each  habitat  variable,  the  mean  is  0  and  the  variance  is  1. 


The  standardized  score  Z  .  for  the  i 

ai 


■  th 


sample  of  variable  a  is  given  by: 
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[Eq  49] 


In  other  words,  the  new  units  of  the  habitat  variables  are  really 
deviations  from  their  respective  means  per  unit  of  standard  deviation  (or 
variance).  In  terms  of  Z  scores,  the  covariance  matrix  becomes  the  corre¬ 
lation  matrix,  since  the  diagonal  elements: 


N 
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for  all  a, 


and  the  rest  of  the  elements: 
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are  equivalent  to  r  .  the  correlation  coefficient  between  X  and  X  , 

otB  a  8 


The  author  feels  that  variable  standardization  justifies  use  of  the 
correlation  matrix  instead  of  the  covariance  matrix  when  dealing  with  habitat 
variables,  even  though  the  covariance  matrix  has  better  statistical  properties 
(3,  44,  71,  39,  17).  The  two  main  problems  are:  (1)  distributional  proper¬ 
ties  and  sampling  theory  (including  hypotheses  testing  and  calculation  of 
confidence  intervals)  of  transformed  variables  extracted  from  correlation 
matrices  are  much  more  complex  than  those  derived  from  covariance  matrices; 
and  (2)  maximizing  the  variance  of  standardized  scores  intuitively  appears 
ambiguous.  A  careful  assessment  of  the  arguments  does  not  preclude  the 
necessity  for  variable  standardization,  especially  when  primary  interest  is  in 
the  first  transformed  variable  (e.g.,  PC  1).  The  author's  experience  has  been 
that  when  variables  scales  are  heterogeneous,  the  correlation  matrix  gives 
results  that  are  ecologically  more  interpretable  than  an  eigenanal ys i s  using  a 
covariance  matrix. 


PCA  can  also  be  performed  on  the  dispersion  matrix  (Appendix  C).  This  is 
known  as  noncentered  PCA  (74,  113,  11).  Noncentered  PCA  has  advantages  in 
classification  or  describing  compositional  variation  that  include  disconti¬ 
nuities,  but  is  subject  to  strong  distortion  (113). 

This  discussion  has  centered  on  finding  Yp  which  represents  the  environ¬ 
mental  impact  gradient.  Actually,  an  e i genana l y s i s  (or  principal  components 
analysis)  extracts  p-transformed  variables  (principal  components):  Yp 
Y 2 . . Yp,  representing  linear  combinations  of  the  original  p-habitat  varia¬ 
bles.  Some  of  these  transformed  variables  may  be  ecologically  interpretable, 
and  since  they  are  orthogonal,  they  would  represent  ecological  gradients  that 
are  uncorreiated  with  habitat  disturbance.  The  problem  is  that  the  p 
extracted  transformed  variables,  each  containing  unique  linear  combinations  ot 
correlated  habitat  variables,  are  difficult  to  interpret  realistically.  A 
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simiLar  analogy  was  seen 
dimensional  space.  In  a 


with  the  original  scatter  of  £  N,  points  in 


h=l 


similar  fashion,  an  orthogonal  rotation  of  axes  can 
be  envisioned  that  produces  closer  agreement  between  the  transformed  variables 
(also  called  factors)  and  the  original  variables.  This  is  referred  to  as 
"simplifying  factor  structure."  Thurston  (103,  104,  105)  first  developed 
qualitative  guidelines  or  principles  for  parsimony,  or  simple  structure;  a 
lucid  discussion  can  be  found  on  pages  94  through  99  of  ref.  43. 


The  first  criteria  for  analytical  rotations  were  developed  by  Carroll 
(12);  an  extensive  review  is  provided  by  Harman  (43).  Ferguson's  (30) 
approach  to  mathematically  defining  parsimony  for  the  rotational  solution  can 
serve  as  a  simple  explanation  for  the  numerous  analytical  procedures  that  have 
been  developed,  since  most  of  the  solutions  are  closely  related.  Ferguson 
reasoned  that  the  simplest  description  of  a  point  in  space  upon  rotation  of 
orthogonal  axes  is  when  one  of  the  axes  passes  through  the  point.  As  one  of 
the  axes  is  rotated  toward  the  point,  the  product  of  the  two  coordinates 
becomes  smaller.  He  therefore  suggested  that  some  function  of  the  sum  of 
products  of  coordinates  be  used  as  a  parsimonious  description  of  these 
points.  Since  coordinates  take  on  negative  as  well  as  positive  values,  he 
proposed  the  sum  of  squares  of  coordinate  products  instead  of  the  simple 
sum.  This  is  exactly  analogous  to  the  familiar  calculation  of  sum  of  squares 
as  a  measure  of  dispersion  around  the  mean.  Analytically,  this  is: 


1  I  (a .  a  -  )' 
j-1  q<r=l  J<?  Jr 


with  R( R— 1 ) / 2  sums  of  p  pairs  of  coordinates.  Parsimonious  solutions  to 
functions  of  this  general  structure  reduce  to  simplifying  the  rows  or  columns 
of  the  factor  matrix  derived  from  the  eigenanalysis  or  principal  components 
solution.  The  factor  matrix  contains  the  "loadings"  or  the  Pearson  product- 
moment  correlation  coefficients  of  the  principal  components  (transformed 
variables)  with  the  original  variables.  PC  1  is  in  the  first  column,  PC  2  is 
in  the  second,  etc. 


The  quartimax  criterion  (72)  simplifies  the  rows  by  finding  orthogonal 
transformations  that  maximize  the  variance  in  the  distribution  of  squared 
factor  loadings.  Carroll  (12)  and  Saunders  (88)  proposed  similar  solutions 
for  row  simplification.  Simplifying  the  rows  of  a  factor  matrix  has  the 
effect  of  associating  the  variables  with  fewer  factors.  Thus,  parsimony  is 
achieved  by  producing  a  few  generalized  factors.  In  practice,  the  first 
principal  component  (often  PC  2  as  well)  has  many  variables  associated  with  it 
and  is  therefore  difficult  to  interpret  ecologically. 


The  varimax  criterion  (59)  is  an  effective  orthogonal  rotation  that 
enhances  the  ecological  interpretation  of  principal  components  >  2.  Essen¬ 
tially,  the  varimax  method  simplifies  the  columns  of  the  factor  matrix.  This 
has  the  effect  of  making  the  correlations  in  each  column  approach  1  or  0,  and 
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a  given  principal  component  to  be  strongly 
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greater  than  four  are  not  usually  interpretable  because  they  explain  such  an 
insignificant  portion  of  sample  variance. 

In  summary,  the  first  orthogonal  rotation  identified  as  Y^  represents  the 
linear  combination  of  original  habitat  variables  that  explains  the  greatest 
component  of  sample  variance.  Subsequent  orthogonal  rotations  (under  the 
varimax  criteria)  simplify  the  columns  of  the  factor  matrix,  and  therefore 
usually  enhance  the  ecological  interpretation  of  Y2,  Yj,  and  sometimes  Y^. 

Habitat  Variables 

Mojave  Desert  Sites  (Low  Ratio  of  Number  of 
Sampling  Units  to  Number  of  Variables) 

Canonical  Analysis  ot  Pi  sc r imi nance .  Figures  5  through  10  show  the 
relative  discriminating  ability  of  habitat-variable  subsets  to  quantitatively 
classify  the  five  study  sites  of  the  central  Mojave  Desert.  Tables  1  through 
7  give  the  Pearson  correlation  coefficients  and  their  significance  for  habitat 
variables  with  the  first  four  canonical  variates.  These  data  represent  seven 
logical  assortments  of  habitat-variable  subsets.  In  reality,  more  than  50 
variable  subsets  were  analyzed.  Ordination  of  study  sites  in  the  canonical 
space  of  habitat  variables  depended  strongly  on  the  specific  choice  of 
variables.  The  complete  variable  set  could  not  be  analyzed,  since  the  number 
of  variables  for  canonical  analysis  is  limited  to  the  number  of  sampling 
units.  Figure  5  and  Table  1  show  the  analysis  using  the  18  variables  from  the 
original  data  set  that  passed  the  tolerance  test.  (This  ensures  that  the  rows 
and  columns  of  the  data  matrix  are  linearly  independent;  therefore,  the  data 
matrix  is  nonsingular  and  has  an  inverse — a  p"  requisite  for  canonical  analy¬ 
sis.)  CV  1  separates  study  sites  on  the  basis  of  environmental  characteris¬ 
tics  of  valleys  contrasted  to  al luvials/high  desert,  and  CV  2  separates  these 
sites  on  the  basis  of  habitat  disturbance.  The  gradient  along  CV  2  is  not  very 
clear  (Table  1).  Note  the  location  of  site  M  in  Figure  5. 

The  analysis  using  only  10  "basic"  habitat  variables  permits  a  clearer 
ecological  interpretation  (see  Figure  6  and  Table  2).  CV  1  separates  study 
sites  by  contrasting  dense  shrub  and/or  forb  cover  with  sandy  substrates--an 
environmental  impact  gradient.  CV  2  separates  the  valley  control  t  rom  the 
other  sites  on  the  basis  of  the  valley  control's  high  burroweed  and  grass 
cover  and  its  coarse  sand  substrates. 

The  use  ot  shrub  height  classes  along  with  the  "basic"  variables  (Figure 
7  and  Table  3)  clearly  separate  the  impacted  sites  from  the  controls  on  CV 
2.  CV  1  effectively  separates  alluvials  and  valleys,  contrasting  gravel 
substrates  and  dense  low  shrub  cover  with  grass  and  coarse  sand  substrates, 
respectively.  However,  the  desirable  feature  of  Figure  6  is  lost,  CV  1  repre¬ 
senting  the  environmental  impact  gradient. 

Figure  8  and  Table  4  represent  the  "basic"  variables,  with  two  variables 
added  to  quantify  shrub  size  heterogeneity  (variability  in  the  diameter  of 
creosote  bush  and  burroweed).  These  two  species  usually  account  tor  more  than 
90  percent  of  the  shrub  cover  in  the  central  Mojave  (64).  Variability  in  the 
heights  of  these  species  correlates  strongly  with  variability  in  diameter 
(Pearson,  r  *  0.86  tor  creosote  bush,  and  0.80  for  burroweed,  F  <  0.001)  so 


only  one  of  these  measures  is  sufficient.  Figure  8  amplifies  the  interpreta¬ 
tion  of  Figure  6  by  adding  creosote  bush  heterogeneity  (crushed  shrubs)  to 
sandy  substrates  as  characteristics  of  disturbed  Mojave  Desert  habitats.  CV  2 
contrasts  grass  and  coarse  sand  substrates  with  gravel  substrates  and  burro- 
weed  size  heterogeneity.  Therefore,  CV  2  contrasts  environmental  characteris¬ 
tics  of  valleys  with  those  of  al luvials/high  desert,  respectively.  Note  the 
locations  of  S  and  M  on  Figure  8,  and  note  that  CV  1  portrays  an  environmental 
impact  gradient.  The  addition  of  variable  CL4  (cover  of  shrubs  >  1.5  m  in 
height)  had  no  effect  on  the  canonical  structure  of  Figure  8  (see  Table  5). 
Shrubs  of  this  height  are  important  characteristics  of  unimpacted  desert 
valleys  (64). 

The  addition  of  variables  for  the  mean  heights  of  creosote  bush, 
burroweed,  and  subdominant  shrub  species  (even  though  these  are  highly 
correlated  with  their  respective  cover  variables)  helped  separate  the  valley 
control  from  the  other  sites  (See  Figure  9  and  Table  6).  Note  that  forb  and 
subdominant  shrub  cover  and  gravel  substrate  have  been  removed  from  CV  1. 
Since  these  variables  characterize  alluvials,  they  not  only  enhanced  the 
separation  of  alluvial  and  valley  controls  on  CV  1,  but  sharpened  the 
alluvial-valley  gradient  on  CV  2.  Since  shrubs  more  than  1  m  tall  character¬ 
ized  unimpacted  habitats,  while  shrubs  less  than  1  m  tall  were  typical  of 
disturbed  Mojave  habitats  (64),  the  variable  L12J  (density  of  shrubs  1.1  to 
<  1.5  m  tall)  was  selected  as  another  potential  discriminating  variable.  The 
resulting  variable  subset  produced  the  best  discriminating  subset  of  variables 
to  effectively  classify  study  sites;  CV  1  clearly  defines  the  environmental 
impact  gradient  (See  Figure  10  and  Table  7).  Note  that  the  alluvial  control 
and  the  high  desert  have  similar  habitat  structure. 

Eigenanalysi s .  Figure  11  shows  the  varimax  rotated  principal  components 
solution  that  used  the  habitat  variable  subset  (Table  7)  which  provided  th' 
best  ecological  interpretation  in  canonical  space.  Table  8  gives  the  Pearson 
correlation  coefficients  of  each  habitat  variable  with  the  first  four  princi¬ 
pal  components  (eigenvectors).  Clearly,  this  ordination  is  inadequate. 

Figure  12  and  Table  9  show  the  ordination  using  the  complete  set  of 
habitat  variables.  Although  the  ordination  presents  a  better  classification 
than  the  one  shown  in  Figure  11,  the  ecological  significance  of  some  of  the 
variables  is  obscure  (e.g.,  the  presence  of  many  of  the  shrub  species  in  only 
some  vegetation  transects). 

Bv  screening  more  than  30  variables  subsets  and  sequentially  adding 
variables  of  known  importance  (64),  an  optimal  set  of  habitat  variables  was 
delimited.  Principal  components  analysis  was  not  as  sensitive  as  CAD  to 
variable  selection.  Also,  a  larger  number  of  habitat  variables  could  be 
included  in  variable  subsets,  since  e i genana I y s i s  solutions  are  not  as 
criticaL  to  matrix  row/column  linear  independence  as  is  canonical  analysis. 

Figure  13  and  Table  10  show  the  optimal  ordination.  PC  1  represents  the 
environmental  impact  grad i ent --dense  rover  ot  creosote  bush  and  burroweed, 
tall  shrubs,  litter  accumulation,  and  >-oarse  sand  subst rates--cont rast ed  to 
high  variability  of  creosote  bush  size  (crushed  shrubs)  and  sandy  sub¬ 
strates.  PC  2  is  positively  associated  wi  ’  h  environmental  characteristics  ot 
alluvials  and  the  high  desert --cover  ot  subdominant  shrub  species,  high 
variability  ot  burroweed  sice  ( 1  urge  i».r rowved  along  washes),  short  creosote 


bush,  and  gravelly  and  rocky  substrates.  PC  3  contrasts  grass  and  density 
heterogeneity  of  creosote  bush  with  forbs,  shorter  creosote  bush,  and  density 
heterogeneity  of  all  shrubs  other  than  creosote  bush.  These  environmental 
characteristics  helped  separate  the  impacted  alluvial  from  its  control. 

Illinois  Successional  Sites  (High  Ratio  of  Number 
of  Sampling  Units  to  Number  of  Variables) 

Canonical  Analysis  of  Discriminance.  The  Illinois  data  set  was  organized 
into  four  subsets  of  habitat  variables:  structural,  typical,  typical  +  tree 
guilds,  and  all  (see  Chapter  4  and  Appendix  E  for  more  details).  Figures  14 
through  17  show  the  mean  canonical  variate  scores  (centroids)  of  the  10  study 
sites  on  the  first  four  canonical  variates.  Tables  11  through  14  give  the 
Pearson  correlation  coefficients  and  their  significance  for  each  habitat 
variable  with  the  first  four  canonical  variates.  Although  there  are  differ¬ 
ences  among  these  four  analyses,  unlike  CAD  with  the  Mojave  Desert  variables, 
the  general  pattern  is  similar,  regardless  of  which  variable  subset  is  used; 
CV  1  consistently  defines  the  major  environmental  gradient  (a  succession 
gradient  in  this  case).  This  "stability"  is  directly  attributable  to  the  high 
ratio  of  sample  sizes  to  the  number  of  variables,  and  was  maintained  even 
though  the  data  "quality"  was  lower  than  that  of  the  Mojave  Desert  example. 
(For  example,  in  the  Mojave  Desert,  each  habitat  parameter  was  estimated  from 
larger  sample  sizes,  and  the  samples  were  collected  over  a  much  larger  area; 
see  Chapters  1  and  4  for  more  details.) 

The  variables  describing  the  habitat  as  the  presence/absence  and  distri¬ 
bution  of  vegetation  in  a  three-dimensional  grid  (Figure  14)  gave  very  similar 
results  as  the  analysis  defining  the  habitat  in  terms  of  ground  cover,  shrub 
characteristics,  and  the  density/distribution  of  tree  sizes  (Figure  15).  Note 
that  the  ecological  interpretation  of  CV  2  and  CV  3  are  reversed  between  these 
two  analyses  (Tables  11  and  12).  This  is  very  common  in  discriminant  analy¬ 
ses.  In  both  analyses,  CV  4  does  not  have  much  discriminating  power. 

The  addition  of  habitat  variables  identifying  tree  species  in  terms  of 
their  basal  areas  improves  discriminating  ability,  effectively  separating  bot¬ 
tomland  (C,H)  from  upland  (I,J)  forests,  not  only  along  CV  1  but  also  along  CV 
2  (see  Figure  16  and  Table  13).  CV  2  also  separates  the  older  shrub  habitats 
(D,E,F)  from  early  succession  stands  (A,B,C)  on  the  basis  of  the  densities  of 
medium-sized  trees,  including  cottonwood  and  snags.  CV  3  separates  sites  on 
the  basis  of  small  tree  (dbh  <  15  cm)  density,  particularly  black  locust. 

The  analysis  employing  all  variables  (see  Figure  17  and  Table  14)  gives 
results  comparable  to  those  of  the  preceding  analysis.  CV  2  separates  upland 
and  bottomland  forests,  and  CV  3  rather  accurately  ranks  sites  on  a  gradient 
from  open  habitats  to  increasing  densities  of  shrubs  and  small  trees  (similar 
to  the  preceding  analysis).  CV  4  portrays  complex  relat ionshi ps  in  subcanopy 
vegetation  structure. 

F.  i  genana  lysis.  Figures  18  through  21  show  the  mean  principal  component 
scores  of  the  10  study  sites  on  the  first  four  principal  components  (eigen¬ 
vectors).  Tables  15  through  18  give  the  Pearson  correlation  coefficients  1  id 
'heir  significance  for  each  habitat  variable  with  the  first  four  principal 
components,  and  summarize  the  association  ot  important  habitat  variables  with 
•a<f,  it  ':e-  principal  components.  The  general  pattern  of  the  four  analyses  is 
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similar.  Groups  of  variables  representing  similar  habitat  components  were 
associated  consistently.  PC  1  represents  the  main  environmental  gradient — a 
successional  gradient  in  which  early  seres  (A,B,C),  shrublands  (D,E,F)  and 
forest  (G,H,I,J)  are  separated. 

Two  study  sites — C  and  F — merit  additional  discussion.  Site  F  is  a  56- 
year-old  abandoned  strip  mine.  Unreclaimed  strip  mines  are  characterized  by 
an  extensive  ridge  and  valley  topography.  The  ridge-and-val ley  relief  of 
unreclaimed  mines  of  this  age  is  very  low,  since  the  ridges  have  eroded,  and 
the  valleys  and  depressions  have  accumulated  silt.  The  troughs  contain 
permanent  water  with  extremely  deep  silt  deposits  (>  15  m).  Local  hydrology 
has  helped  maintain  a  high  water  table.  These  environmental  conditions  have 
not  only  limited  tree  density,  but  dramatically  decreased  the  growth  rates  of 
volunteer  trees  on  the  ridges.  A  study  of  growth  rings  from  tree  borings 
verified  extremely  low  growth  rates  over  the  last  20  years.  Site  F  consisted 
of  a  mosaic  of  open  areas  and  very  dense  shrub  cover  interspersed  with  medium 
to  large  trees  and  an  unusally  high  representation  of  woody  vines.  Therefore, 
because  of  its  low  tree  density  and  the  presence  of  open  habitats,  this  site 
ranked  below  sites  D  and  E  on  the  succession  axis  in  three  of  the  four 
analyses.  The  analysis  that  uses  all  habitat  variables  ranked  it  correctly  in 
an  age  sequence.  Note  that  Site  F  is  located  near  the  origin  (0,0)  in  PC  1/PC 
2  space  that  incorporates  all  variables  (Figure  21). 

Site  C  has  very  high  densities  of  saplings  that  are  <  15  cm  dbh 
(mostly  <  8  cm  dbh),  and  therefore,  has  higher  scores  than  sites  A  and  B  when 
these  variables  are  emphasized  (Figures  19  and  20).  However,  when  the 
structural  variables  are  considered  (Figures  18  and  21),  site  C  has  lower 
scores  than  sites  A  and  B.  This  has  been  interpreted  as  strong  evidence  that 
the  environmental  gradient  PC  1  is,  in  reality,  not  directly  representing 
habitat  characteristics  of  age  per  se,  but  rather  is  tracking  the  complexity 
of  vegetation  structure,  which  includes  vegetation  volume  as  an  important 
component.  (Of  course,  vegetation  complexity  and  volume  are  usually  associ¬ 
ated  with  successional  age.)  Except  for  a  few  ponds,  site  C  consists  almost 
entirely  of  two  layers  of  vegetation:  a  ground  cover  of  smooth  brome  grass 
( Bromus  inermi s )  and  a  uniform-aged  stand  of  young  black  locust  (Robinea 
pseudoacac  i  a ) .  The  use  of  all  variables  for  the  PCA  clearly  expresses  this 
vegetation  complex i t y/ vol ume  gradient  (Figure  21). 

Structural  Variables  (See  Figure  18  and  Table  15) 

The  relative  relationships  of  the  four  forest  sites  are  not  easily 
interpretable  with  this  subset  of  variables,  except  in  the  case  of  bottomland 
forests  C  and  H  along  PC  4.  The  younger  bottomland  forest  (C)  has  more 
openings,  and  therefore  has  both  a  higher  patchiness  and  a  higher  density  of 
low  vegetation  (1  to  3  m).  Note  also  the  location  of  the  "shrubby  sites"  on 
PC  4. 

PC  2  represents  the  density  of  "midstory"  vegetation  (2  to  7.5  m).  Site 
C--the  young  black  locust  grove--ord i nates  strongly  on  PC  2.  Site  D--an  older 
and  more  complex  black  locust  grove  and  upland  forests  (I,J)  also  shows  good 
representation  of  this  vegetation  layer.  Early  successional  stages  (A,B) 
occur  at  the  other  end  of  this  gradient. 
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PC  3  contrasts  high  ground  cover  of  vegetation  on  the  positive  axis  with 
the  presence  of  aquatic  habitats  on  the  negative  axis. 

Typical  Variables  (See  Figure  19  and  Table  16) 

The  first  two  principal  components  clearly  ordinate  the  study  sites  into 
four  categories:  open  habitats,  shrub/saplings,  bottomland  forests,  and  up¬ 
land  forests.  PC  2  represents  a  strong  gradient  of  shrub/sapling  density.  In 
the  case  of  the  forest  sites,  "shrubs"  represent  the  seedlings  of  canopy  spe¬ 
cies.  Note  that  bottomland  forests  and  early  successional  seres  have  low 
scores  on  PC  2. 

PC  3  represents  the  density  of  large  trees  (>  53  cm  dbh).  The  older 
bottomland  forest  with  fast-growing  species,  such  as  cottonwoods  ( Popul us 
del toides )  and  sycamores  ( Platanus  occidental  is),  have  the  highest  scores  on 
this  axis.  At  site  I,  there  are  some  large  representatives  of  these  species 
in  the  riparian  zone. 

PC  4  represents  a  gradient  of  high  ground  cover  combined  with  a  high 
species  richness  of  shrubs.  Sites  B  and  C  score  low  on  this  gradient  because 
of  the  presence  of  aquatic  habitats  as  well  as  a  poor  representation  of  shrub 
species.  More  than  97  percent  of  the  shrubs  at  site  C  were  black  locust 
sprouts  or  seedlings,  while  most  of  the  shrubs  at  site  B  were  willow  clumps 
(Sal ix  interior)  around  ponds,  with  scattered  rose  (mainly  Rosa  multiflora, 
some  R^  rugosa ) ,  and  a  few  seedlings  of  black  cherry  ( Prunus  serot ina ) , 
cottonwood,  sycamore,  and  box  older  (Acer  negundo ) . 

Typical  ♦  Tree  Guild  Variables  (See  Figure  20  and  Table  17) 

The  addition  of  tree  guilds  or  species  (see  Appendices  E  and  F)  strongly 
separated  bottomland  and  upland  forests  on  PC  2.  In  comparison  to  the  pre¬ 
vious  analysis,  these  forests  are  also  more  clearly  separated  on  PC  1. 

PC  3  represents  high  shrub  cover  and  small  tree  (<  15  cm  dbh)  density 
ilong  with  high  shrub  heterogeneity  (shrub  patchiness). 

PC  4  represents  the  basal  area  of  black  locust.  This  species  was  planted 
on  sites  C,  D,  and  E.  (Appendix  D  provides  the  ages  of  black  locusts,  taken 
from  tree-boring  data,  for  these  sites).  Overall,  this  analysis  probably 
overest imates  the  importance  of  tree  species  compositions  in  ordinating  study 


All  Variables  (See  Figure  21  and  Table  18) 


Use  of  all  the  variables  provides  the  most  informative  ordination,  as 
well  as  the  most  realistic  ecological  interpretation.  PC  1  accurately  ordi¬ 
nates  the  vegetation's  relative  comp  1  ex i t y / vo 1 ume .  PC  2,  the  second  most 
important  environmental  gradient,  represents  the  gradient  of  open  habitats 
(negative  scores)  to  that  of  dense  shrub  cover  and  small  trees  (positive 
scores ) . 


PC  3  contrasts  environmental  characteristics  of  bottomland  forests  (very 
large  trees,  open  subcanopy,  and  bottomland  tree  species)  with  characteristics 
of  upland  forests  (dense  subcanopy  and  seedlings,  and  upland  tree  species). 
Note  the  gradient  of  decreasing  subcanopy  from  J-I-G-H  on  PC  3.  A  parallel 
decrease  in  vegetation  complexity/density  occurs  along  PC  1,  but  is  not  nearly 
as  striking. 

PC  4  represents  a  gradient  of  heavy  ground  cover  (positive)  versus  the 
presence  of  aquatic  habitats. 

Classification  Using  Cluster  Analysis 

The  classification  results  derived  from  cluster  analysis  depended  on  the 
specific  combinations  of  habitat  variables  used.  Figures  22  through  24  show, 
respectively,  the  results  obtained  by  using  the  best  subset  of  variables  for 
CAD  (Table  7),  the  best  subset  for  PCA  (Table  10),  and  the  complete  set  of 
habitat  variables  (Appendix  B).  The  use  of  all  58  habitat  variables  produced 
the  best  classification,  and  correctly  placed  most  of  the  individual  transects 
into  their  respective  study  sites.  Also,  this  analysis  immediately  separated 
the  impacted  sites  from  the  controls.  Since  the  clustering  algorithm 
spatially  relates  observations  (transects)  in  terms  of  arithmetic  distances  in 
the  n-dimensional  space  (axes)  of  n  predictor  (independent)  variables,  cluster 
linkage  occurs  relatively  late  when  many  environmental  variables  are  used; 
therefore,  transects  are  grouped  only  when  they  have  many  different  environ¬ 
mental  attributes  in  common.  Note  that  the  relative  length  of  the  linkage 
modes  increases  when  more  variables  are  used  (Figures  22  through  24). 

The  cluster  analyses  (Figures  22  through  24)  summarize  or  reveal  several 
interesting  features.  The  alluvial  control  and  the  high  desert  are  environ¬ 
mentally  very  similar.  Transects  VC1  and  VC2  are  very  different  from 
transects  VC3  and  VC4,  unless  shrub  species  compositions  are  taken  into 
account  (Figure  24).  VC3  and  VC4  are  almost  parallel  and  located  on  a  slight 
slope,  whereas  VC1  and  VC2  parallel  each  other  on  the  valley  bottom.  VC1 
paralleled  a  rocky  ridge  at  125  to  200  m.  Interestingly,  VC1  resembles  M2 
(Figure  23).  M2  is  low  on  an  alluvial,  and  Army  training  activities  have  made 
the  substrate  sandy.  Transects  SI  and  S2  are  very  similar.  These  two 
transects  are  located  in  the  most  severely  disturbed  portion  of  study  sites. 
Although  not  located  close  to  one  another,  S3  and  S6  share  common  environ¬ 
mental  features.  In  the  S  study  area,  shallow,  irregular  washes  were  present 
only  in  these  two  transects.  These  transients  not  only  had  patches  of  larger 
shrubs,  but  increased  shrub  species  richness,  since  washes  in  desert  eco¬ 
systems  are  concentrated  sources  of  water  during  showers  or  the  rainy  seasons. 

Figures  25  through  28  show  the  cluster  analysis  results,  using  principal 
component  scores  as  predictor  variables.  Only  four  eigenvectors  were  rotated 
in  the  varimax  solution  ot  PCA.  The  use  of  all  four  principal  components 
(Figure  25)  separated  the  high  desert  and  the  alluvial  control,  emphasized  the 
uniqueness  of  VC1,  recognized  the  similarity  of  SI  and  S2 ,  and  classified  S3 
and  S6  (the  least  impacted  transects  of  the  S  study  site)  with  the  moderately 
impacted  transects. 

The  use  of  the  first  three  principal  component  scores  (Figure  26) 
classified  sites  AC  and  I.  together,  hut  in  minor  details  (mainly  that 


relationships  of  VC3  -  VC4  and  Si  -  S2)  was  similar  to  the  classification 
obtained  with  four  principal  component  scores. 

Use  of  the  first  two  principal  component  scores  (Figure  27),  which 
explain  70.1  percent  of  the  original  data  variance,  classified  the  high  desert 
with  the  moderately  impacted  alluvial.  Two  transects  from  the  moderately 
impacted  alluvial  (M2  and  M3)  were  grouped  with  transects  S3  and  S6. 

Figure  28  shows  the  classification  using  only  one  variable — PC  1  scores 
(PC  1  explained  48.2  percent  of  the  data  variance).  This  variable  represents 
a  linear  combination  of  originaL  habitat  variables  that  define  the  environ¬ 
mental  impact  gradient.  Initially,  three  groups  of  transects  are  clustered: 
(1)  severely  impacted,  (2)  controls,  and  (3)  lightly  impacted  high  desert/ 
moderately  impacted  alluvial  (including  one  AC  transect).  A  more  detailed 
clustering  represents  an  excellent  c  lass i f i ca t i on ,  particularly  in  terms  of 
environmental  disturbance.  Figure  28,  employing  only  a  single  variable  for 
clustering,  provides  information  similar  to  that  given  in  Figure  24  (58 
variables  used).  Also,  the  use  of  only  PC  1  scores  separates  AC  and  L  more 
effectively,  and  recognizes  the  close  environmental  similarity  of  S3  to  S6, 
VC3  to  VC4,  and  AC3  to  L  transects.  This  analysis  also  simplifies  the 
interpretation  of  VC1  by  classifying  it  with  three  other  control  transects. 
Figure  28  appears  to  represent  the  most  realistic  classification,  despite  the 
fact  that  only  a  single  predictor  variable  was  used. 

A  series  ot  cluster  analyses  was  also  performed  on  varimax  rotated 
principal  components  solutions  using  the  maximum  possible  number  of  eigen¬ 
vectors.  Most  researchers  do  this  routinely.  For  these  analyses,  the  oest 
subset  of  habitat  variables  for  PCA  was  used  (the  same  variables  were  used  for 
Figures  25  through  28),  and  the  number  of  extracted  eigenvectors  equaled  the 
number  of  variables  (a  total  of  25).  Figures  29  through  32  show  the  results 
of  these  analyses  using  the  first  principal  component,  then  the  first  four, 
the  first  10,  and  the  first  20,  respectively.  Note  that  as  the  number  of 
principal  components  used  in  the  cluster  analysis  increases,  it  becomes  more 
difficult  to  form  clusters.  In  other  words,  each  transect  becomes  more 
unique.  Also,  not  only  do  relationships  among  transects  become  more  obscure, 
but  the  realism  and  ecological  interpretation  of  the  classification  is  lost. 
Kven  the  use  ot  only  one,  or  several  of  the  major  principal  components  does 
not  produce  a  good  classification.  In  Figure  29,  the  least  and  most  impacted 
transects  at  S  (SI  and  S6)  are  grouped,  transects  from  AC,  M,  and  L  are 
grouped  in  two  different  cluster  sets,  and  transect  S4  is  identified  as 
unique.  In  Figure  30,  transects  from  four  of  the  study  sites  (M,  L,  VC,  AC) 
show  ambiguous  relationships. 

Figures  33  through  36  show  the  cluster  analyses  obtained  by  using 
canonical  variate  scores.  Note  how  rapidly  all  the  experimental  transects  are 
correctly  classified  into  their  respective  study  sites.  Use  of  either  the 
first  four,  the  first  three,  or  the  first  two  canonical  variate  scores  gave 
essentially  identical  results.  Like  PC  1,  CV  1  represents  the  environmental 
impact  gradient,  but  cannot  distinguish  among  control  transects,  classifying 
alluvial  and  valley  controls  into  a  single  group. 
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DISCUSSION  AND  DATA  SUMMARY 


The  analytical  description  of  environmental  gradients  has  been  shown  to 
be  an  eigenanlysis  problem,  and  is  a  general  solution  of  the  well  known 
principal  components  solution.  In  environmental  impact  assessment,  this 
gradient  defines  habitat  disturbance  and  corresponds  to  principal  component 
1.  Higher-order  environmental  gradients  may  be  extracted  that  represent 
other,  possibly  important,  ecological  attributes  that  are  independent  of 
environmental  impacts.  If  they  are  present,  PC  2  and  PC  3  and  sometimes  PC  4 
represent,  respectively,  in  decreasing  importance,  these  other  gradients. 

Principal  components  analysis  was  robust  with  respect  to  variable 
combinations  selected,  number  of  variables,  sample  sizes,  and  data  matrix 
singularity  (row/column  linear  dependencies).  The  first  few  eigenvectors  were 
affected  very  little  by  whether  the  input  matrix  was  of  full  rank — an 
important  consideration  when  sample  sizes  are  small.  However,  PCA  analysis 
and  interpretation  benefit  by  having  large  ratios  of  sample  sizes  to  number  of 
variables . 


PC  1  consistently  represented  the  environmental  impact  gradient  (Figures 
11  through  13  and  18  through  21).  Interestingly,  the  use  of  only  one 
variable — PC  1  (derived  from  25  original  variables) — classified  23  study 
transects  in  the  Mojave  Desert  as  well  as  all  58  of  the  original  habitat 
variables  (compare  Figures  28  and  24).  Therefore,  it  is  concluded  that  the 
stat i st ical /ord i nat ion  properties  of  PCA,  along  with  its  accepted  popularity 
in  a  wide  variety  of  disciplines,  and  readily  available  software  for 
mainframe,  mini-  and  microcomputers  make  PCA  an  excellent  choice  for  environ¬ 
mental  impact  assessments  and  parsimonious  description  of  habitat  structure. 
Furthermore,  it  represents  an  analytical  framework  for  assessing 
species/habitat  associations  along  environmental  gradients  (63,  64). 

PCA  limitations  have  been  reviewed,  with  strong  emphasis  on  nonlinear 
relationships  among  environmental  variables  (or  their  subsets)  and  the 
position  of  study  sites  or  sampling  points  along  environmental  gradients  (100, 
5,  34,  35,  48,  37,  75,  113,  49,  36,  33).  Ordination  techniques  emphasizing 
nonlinear  responses  are  being  explored,  and  comparative  evaluations  will  be 
discussed  in  a  separate  USA-CERL,  Technical  Report. 


PCA  is  bound  by  a  number  of  statistical  assumptions  that  are  similar  to 
univariate  parametric  statistical  inferences  (71,  4,  90).  However,  in  most 
applications,  PCA  has  been  found  to  be  robust  to  heteroscedast  i  c  i  ty  and 
nonnormality  (44,  40,  41,  67,  personal  observation).  This  is  an  important 
consideration,  since  ecological  data  rarely  conform  to  parametric  assump¬ 
tions.  Good  evidence  of  robustness  was  that  PCA  reproduced  relatively  similar 
ordinations,  both  with  raw  data  and  transformed  data.  Variable  transforma¬ 
tions  vastly  improve  the  fit  of  raw  data  to  parametric  assumptions,  greatly 
reducing  skewness,  variance  heterogeneity,  and  nonlinearity  (96,  20a,  20, 
personal  observation).  Nevertheless,  all  data  were  transformed  before 
analysis  (See  Appendix  C  and  Chapter  4).  Cassie  and  Michael  (13),  Whittaker 
(113),  and  Pimentel  (78)  suggest  data  transformations  in  PCA. 

Whenever  variables  of  different  scales  and  measures  (such  as  environmen¬ 
tal  or  habitat  variables)  are  incorporated  into  PCA,  variable  standardization 
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(e.g.,  Z  scores)  was  found  to  produce  ecologically  more  relevant  ordina¬ 
tions.  Therefore,  correlation  matrices  were  used  for  all  PCA  input.  Since 
principal  components  are  not  invariant  under  linear  transformations,  the 
extracted  eigenvectors  of  a  correlation  matrix  are  not  equivalent  to,  nor  can 
they  be  derived  from,  that  of  a  covariance  matrix.  Although  researchers  use 
correlation  matrices  more  frequently  for  PCA  (17),  covariance  matrices  are 
theoretically  more  desirable  because  they  have  better  stat i st i cal /mathemat i cal 
properties  related  to  statistical  inference,  distribution  theory,  and 
interpretations  (  3,  44,  71  ,  39,  17  ).  Also,  it  may  be  more  difficult  to 
interpret  the  coefficients  given  by  the  eigenvectors  (90).  These  arguments 
have  been  assessed  carefully,  and  the  practicality  and  advantages  of  variable 
standardization,  especially  since  primary  interest  is  in  the  first  principal 
component,  far  outweigh  theoretical  arguments  and  mathematical  vigor. 

The  varimax  criteria  (59)  for  orthogonal  rotation  of  principal  component 
solutions  was  used  in  all  PCA  to  simplify  factor  structure  (43).  This 
technique  enhances  the  ecological  interpretation  of  PC  2,  PC  3,  and  PC  4. 
Only  the  first  four  principal  components  were  used  for  varimax  rotation,  since 
the  use  of  high-order  transformation  matrices  resulted  in  rotated  solutions 
tor  which  factor  pattern  matrices  (principal  component  loadings)  were  diffi¬ 
cult  to  interpret  in  terms  of  the  original  predictor  variable  set.  This  was 
the  direct  consequence  of  the  very  small  eigenvalues  associated  with  higher- 
order  principal  components.  (See  Chapter  5  for  more  explicit  details.) 

Canonical  analysis  of  d i sc r iminance  proved  very  useful  for  exploratory 
data  analysis  and  for  selecting  habitat  variables  that  were  useful  for 
distinguishing  study  sites  (Figures  10  and  14  through  17).  Use  of  at  least 
the  first  two  canonical  variates  accurately  classified  all  transects  into 
their  respective  study  sites  in  a  desert  ecosystem  (Figure  35).  Use  of  only 
the  iirst  canonical  variate,  which  represented  the  environmental  impact 
gradient  (optimal  CAD  variable  subset)  could  not  distinguish  between  the  two 
control  study  sites  (Figures  36  and  10).  The  addition  of  canonical  variate  2, 
representing  a  gradient  of  habitat  characteristics  that  contrasted  alluvials 
to  valleys,  was  necessary  for  the  complete  discrimination  of  study  sites 
(Figure  10).  CAD  was  also  used  successfully  to  classify  the  Mojave  Desert 
study  sites  on  the  basis  of  bird  or  small  mammal  community  structure  (63,  64). 

This  report  has  shown  that  the  results  of  canonical  analyses  were 
strongly  dependent  on  which  combination  of  variables  was  selected  for  analysis 
(Figures  5  through  10).  Canonical  analysis  with  various  combinations  or  sub¬ 
sets  ot  variables  was  very  sensitive  to  small  sample  sizes.  Furthermore, 
discriminant  analyses  are  very  sensitive  to  linear  dependency  among  variables 
(e.g.,  values  ot  a  given  variable  cannot  be  similar  to  linear  combinations  ot 
other  variabies--a  feature  difficult  to  avoid  with  highly  correlated  environ¬ 
mental  variables  and  small  sample  sizes).  These  difficulties  of  CAD  are  inde¬ 
pendent.  ot  the  scaling,  accuracy,  precision,  or  dispersion  ot  collected  data 
-alues.  However,  appropriate  variable  transformations  are  beneficial,  because 
they  reduce  heteroscedast i c i ty  of  covariance  matrices.  The  use  ot  large 
sample  sizes  with  respect  to  the  number  of  environmental  variables  measured  is 
highly  recommended,  since  it  minimizes  the  usual  problems  encountered  with 
CAD. 

Cluster  analysis  was  nut  as  effect i ve  as  PCA  and  CAD  in  consistent  ly  and 
n  <  urat el  y  (  lassitying  the  Mojave  Desert  study  transects.  PCA  and  CAD 


ordinations  persistently  associated  individual  transects  with  their  respective 
study  sites.  This  was  not  surprising,  since  these  ordinations  were  derived  by 
weighing  the  original  variables  according  to  a  "recipe"  that  can  uncover 
underlying  patterns  (if  they  exist).  As  long  as  relevant  ecological  patterns 
are  present  (e.g.,  impact  or  succession  gradients),  techniques  such  as  PCA  or 
CAD  will  be  superior  to  clustering  analysis.  Recall  that  cluster  analysis  was 
most  effective  when  using  variables  derived  from  other  multivariate  tech¬ 
niques,  especially  CAD.  R-analysis — the  clustering  of  variables  on  the  basis 
of  samples  —  is  recommended  as  an  excellent  technique  for  initial  exploratory 
data  analysis,  reducing  data  dimensionality,  identifying  independent  variable 
subsets,  and  eliminating  redundant  variables. 
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7  CONCLUSIONS 


The  concept  of  environmental  gradient  analysis  has  been  demonstrated  to 
be  a  parsimonious  and  rigorous  methodology  for  quantifying  environmental 
impact  assessments  or  successional  seres.  The  analysis  has  potential  for 
analytically  describing  a  broad  range  of  environmental  or  ecological  phe¬ 
nomena  . 

The  analytical  description  of  environmental  gradients  has  been  shown  to 
be  an  e igenanal ys i s  problem,  mathematically  equivalent  to  the  largest  eigen¬ 
vector  (or  first  principal  component)  of  a  principal  components  analysis.  The 
second  or  third,  and  sometimes  the  fourth,  principal  components  may  also  have 
relevant  envi ronment a  1 /eco logi cal  information.  Specific  suggestions  have  been 
made  to  optimize  the  ecological  relevance  and  interpretation  of  principal 
components  ordinations  in  terms  of  environmental  gradients. 

The  analytical  representation  of  an  envi ronmental  gradient,  itself  a 
single  independent  (predictor)  variable,  had  environmental  information  similar 
to  that  in  the  combination  of  all  original  58  habitat  variables  that  described 
five  different  study  sites  at  Fort  Irwin.  The  basis  of  this  finding  is  that 
each  produced  a  similar  and  accurate  classification  of  the  23  Mojave  Desert 
study  transects. 

Principal  components  ordinations  were  robust  with  respect  to  sample 
sizes,  variable  choice,  number  of  variables,  data  matrix  singularity,  hetero- 
scedast i c i t y ,  and  nonnormality.  Therefore,  principal  components  analyses 
represent  an  excellent  analytical  basis  for  environmental  impact  assessments 
and  parsimonious  description  of  habitat  structure,  as  well  as  assessing 
species-habitat  associations  along  environmental  gradients. 

Ecological  ordinations  with  discriminant  analyses  have  been  shown  to  be 
sensitive  to  sample  sizes,  the  choice  of  descriptive  variables,  and  linear 
dependency  among  variables.  Although  preliminary  variable  screening  using 
bivariate  correlations  and  cluster  analysis  (R-mode)  to  eliminate  variable 
redundancy  is  highly  recommended,  ecological  realism  is  often  enhanced  by 
incorporating  several  highly  i n t racor re l at ed  variables  to  quantify  an  environ¬ 
mental  attribute.  Canonical  analysis  of  d i scr i mi  nance  may  be  a  useful  mecha¬ 
nism  for  choosing  among  variable  subsets.  Despite  these  analytical  approaches 
tor  variable  selection  or  rejection,  it  is  ultimately  the  experience  of  the 
researcher  and  his/her  familiarity  with  the  envi ronmental /ecol ogi cal  process 
being  modeled  that  plays  the  major  role  is  selecting  the  environmental  varia¬ 
bles  tor  data  analysis.  Variable  choice,  as  well  as  ordination  sensitivity, 
become  less  significant  when  the  ratio  of  sample  sizes  to  number  ot  variables 
increases.  Interestingly,  this  property  is  independent  ot  the  variance,  pre¬ 
cision,  or  accuracy  ot  the  collected  data,  but  is  a  fundamental  property  ot 
the  matrix  mechanics  inherent  in  multivariate  analyses.  Fheretore,  variables: 
must  be  chosen  with  care  and  ecological  insight,  and  adequate  sample  sizes  are 
mandat  o  r  y . 
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situations.  The  specific  nature  of  the  environmental /ecological  investi¬ 
gation,  the  desired  research  or  interpretive  goals,  the  resources  available  or 
committed  to  data  collection  or  analyses,  and  the  experience  of  the  researcher 
all  play  major  roles  in  ordination  and  classification  technologies. 


SAMPLING 


SCALE  lcm  =  2m  Major  reference  axis  of  study  site 

Figure  2.  Schematic  representation  of  the  26-m-diameter  sampling  quadrat 
located  within  each  Illinois  sampling  unit  showing  the  relative 
orientation  of  the  vegetation  "hits"  transects.  (At  point  "A,"  the 
presence/absence  of  ground  cover,  canopy  cover,  level  topography, 
and  aquatic  habitat  was  recorded.  At  point  "B,"  the  same  data  were 
collected,  with  the  addition  of  the  presence/absence  of  vegetation 
in  11  vertical  height  categories.) 
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Table  1 


Pearson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
Mojave  Desert  Habitat  Variables  That  Passed  the  Tolerance  Test 


CV  1 

CV  2 

CCB 

0.84 

CBW 

0.88 

CREST 

-0 . 5 1 2 

CRASS 

0.582 

FORB 

LIT 

-0.622 

CRAV 

-0.92 

CSAND 

0.641 

0.79 

SAND 

ROCK 

-0.73 

CL4 

0.77 

CVDCB 

-0.89 

CVDBW 

CVDR 

1 

o 

'■'J 

CT- 

CVHCB 

CN 

oo 

O 

1 

-0.89 

CVHBW 

CVHR 

CVNCB 

0.532 

P  <  0.0001 

1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 


Table  2 

Pearson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
"Basic"  Subset  of  Mojave  Desert  Habitat  Variables 


CV  1 

CV  2 

CV  3 

CV  4 

CCB 

O  O 

OJ  OO 

0.641 

CBW 

CREST 

0.56z 

CRASS 

FORB 

LIT 

0.78 

0.562 

0 . 5  1 2 

CRAV 

CSAND 

0.91 

0.88 

SAND 

ROCK 

-0.94 

-0.542 

P  <  0.0001 


1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 
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Table  3 


Pearson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
"Basic"  ♦  Shrub  Height  Subset  of  Mojave  Desert  Habitat  Variables 


CV  1 

CV  2 

CCB 

0.91 

CBW 

CREST 

0.86 

CRASS 

0 . 5  1 2 

FORB 

0 . 5  92 

LIT 

0 . 52^ 

CRAV 

-0.76 

0.691 

0 . 5  7  2 

CSAND 

0.592 

SAND 

ROCK 

1 

O 

SO 

ro 

L2 

-0.522 

L4J 

0.73 

L8 

0 . 5  5  2 

L10 

0.73 

L12J 

0.92 

CL4 

0.83 

P  <  0.0001 

CV  4 


-0.51* 


1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 


Table  4 

Pearson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
"Basic"  +  Shrub  Size  Heterogeneity  Subset  of  Mojave  Desert 

Habitat  Variables 


CV  1 

CV  2 

CCB 

0.91 

CBW 

0.82 

0 . 5  1 2 

CREST 

CRASS 

o 

o 

0.64 

FORB 

LIT 

CRAV 

0.7  1 

-0.65 

CSAND 

0.74 

SAND 

1 

o 

vO 

*. r 

ROCK 

CVDCB 

-0.85 

CVDBW 

-0.78 

P  <  0.0001 

1  0.0001  <  P  <  0.001 


2  0.001  <  P  <  0.01 
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Table  5 


Pearson  Correlation  of  the  Pirst  Pour  Canonical  Variates  With  the 
Habitat  Variables  Prom  Table  4  +  Cover  of  Shrubs  >  1.5  m  in  Height 


CC8 

CV  1 
0.90 

CBW 

0.81 

CREST 

0.51 

CRASS 

0.64 

FORB 

0.72 

LIT 

CRAV 

0.73 

CSAND 

SAND 

-0.95 

ROCK 

CVDCB 

-0.84 

CVDBW 

CL4 

0.77 

CV  2  CV  3 

0.641 

-0.631 

0.76 

-0.77 


P  <  0.0001 

1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 


CV  4 
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Table  6 

Pearson  Correlation  of  the  Pirst  Pour  Canonical  Variates  With  the 
Habitat  Variables  Prom  Table  5  ♦  Mean  Heights  of  Dominant  and 

Subdominant  Shrub  Species 


CV  1 

CV  2 

CCB 

0.87 

CBW 

CREST 

0.88 

CRASS 

-0.62' 

FORB 

0.522 

0.68 

LIT 

CRAV 

0.92 

CSAND 

0.76 

SAND 

ROCK 

-0.78 

1 

O 

-"J 

CVDCB 

CT' 

O 

1 

CVDBW 

0.72 

CL4 

0.80 

HCB 

0.92 

HBW 

HREST 

0.83 

P  <  0.0001 

1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 


65 


°  %V-  A  . s’ 


.  -V  , 


A  N 


Table  7 


Pearson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
Habitat  Variables  From  Table  6  ♦  Density  of  Shrubs 
1.1  to  <  1.5  m  in  Height 


CCB 

CBW 

CREST 

CRASS 

F0R8 

LIT 

CRAV 

CSAND 

SAND 

ROCK 

CVDCB 

CVDBW 

L12J 

CL4 

HCB 

HBW 

HREST 


0.58, 

0.532 

-0.90 


P  <  0.0001 


1  0.0001  <  P  <  0.001 


2  0.001  <  P  <  0.01 


Table  8 


Pearson  Correlation  of  the  First  Four  Principal  Components 
Mojave  Desert  Habitat  Variables  That  Formed  the  Optimal 
for  CAD  (Table  7  Variables) 


CCB 

PC  1 
0.78 

CBW 

0.81 

CREST 

CRASS 

FORB 

LIT 

0.70 

GRAV 

CSAND 

0.88 

SAND 

-0.65 

ROCK 

CVDCB 

-0.89 

CVDBW 

L12J 

0.74 

CL4 

0.69 

HCB 

0.82 

HBW 

0.89 

HREST 

PC  2  PC  3 

0.82, 

0 . 5  5  2 

0.90 

0.86 

-0.562 

0.641 

0.91 


P  <  0.0001 

1  0.0001  <  P  <  0.001 


With  the 
Subset 

PC  4 

-0.582 

0.87 

0.74 


2  0.001  <  P  <  0.01 


Table  9 


Pearson  Correlation  of  the  First  Four  Principal  Components  With 
the  Complete  Set  of  Mojave  Desert  Habitat  Variables 


CCB 

CBW 

CCHE 

CSH 

CTB 

CMT 

CCAS 

CMD 

CA 

CCD 

C123 

CREST 

CL1 

CL2 

CL3 

cu 

HCB 

HBW 

HCHE 

HSH 

HTB 

HMT 

HCAS 

HMD 

HA 

HCD 

H 1 2  3 

HREST 

CRASS 

FORB 

LIT 

CRAV 

CSAND 

SAND 

ROCK 

CVDCB 

CVDBW 

CVDR 

CVHCB 

CVHBW 

CVHR 

CVNCB 

CVNBW 

CVNR 

L2 

L4 

L6 

L8 

L10 

L12 

LI  A 

L16 

L18 

L20 

L22 

L27 

SHCOV 

NSP 


PC  1 
0.94 
0.89 

0.75 


PC  2 


0.75 

0.592 


0.701 


0.81 

0.92 

0.89 

0.84 

0.90 

0.83 

0.77 


0.622 

0.701 


0.73 

0.522 


0.671 

0 . 5  5  2 

0.522 

0.62^ 

0.701 

-0.81 

-0.88 


0.592 


0.631 
0 . 6 1 2 


PC  3  PC  4 


0.691 

0.80 

0.71 


0.622 


0 . 5  7  2 

0.77 

0.81 


0.631 

0.68^ 

0.57^ 

-0.622 


-0.90 


0.701 


-0.54 


2 


0.74 

0 .88. 


5  3 
72 
87 
93 
74, 


0.69 


0.74 


0.651 


0.98 


0.89 


P  <  0.0001 

1  0.0001  <  P  <  0.001 


2  0.001  <  P  <  0.01 
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Table  10 


Pearson  Correlation  of  the  First  Four  Principal  Components  With  the 
Mojave  Desert  Habitat  Variables  That  Formed  the  Optimal 

Subset  for  PCA 


PC  1 

PC  2 

PC  3 

PC 

:  4 

CCB 

0.93 

CBW 

0.87 

1 

CREST 

0.641 

CRASS 

-0.75 

FORB 

0.  72 

LIT 

0.622 

9 

CRAV 

0.62z 

CSAND 

0.75 

SAND 

-0.80 

9 

ROCK 

0.60^ 

CVDCB 

-0.88 

CVDBW 

0.86 

CVDR 

CVHCB 

-0.90 

CVHBW 

0.85 

CVHR 

9 

CVNCB 

-0.55^ 

9 

CVNBW 

0.58; 

-0. 

59 

CVNR 

0 .6 1 2 

L2 

0. 

81 

L4J 

0.82 

L8 

L10 

0.681 

0.57^ 

0.562 

L12J 

0.92 

CL4 

0.85 

P  <  0.0001 

1  0.0001  <  P  <  0.001 


2  0.001  <  P  <  0.01 
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Figure  14.  Ordination  of  Illinois  study  sites  in  canonical  space  based  on  the 
structural  subset  of  habitat  variables.  (See  Table  11.  Study 
sites  are  identified  in  Appendix  D.  The  loci  represent  mean 
canonical  variate  scores.) 


Table  11 


Pearson  Correlation  of  the  First  Pour  Canonical  Variates  With  the 
Structural  Subset  of  Illinois  Habitat  Variables 


CV  1 

CV  2 

CV  3 

CV  4 

LI 

-0. 162 

L2 

-0.23 

-0.47 

0.46 

L3 

0.27 

0.22 

0.35 

L4 

0.38 

0.41 

0.28 

0.39 

L5 

0.36 

0.37 

0.48 

0.26 

0.19?; 

L6 

0.48 

o 

o 

0.38 

L7 

0.62 

0.49 

0 . 14" 

L8 

0.71 

0.50 

-0.21 

-0.181 

L9 

0.81 

0.47 

L10 

0.91 

0.21 

LI  1 

0.98 

-0. 142 

VECVOL 

0.63 

0.29 

0. 142 

VHW 

0.76 

0.50 

0  .  1 9  1 

HZHW 

0.38 

0.24 

GC 

-0.  30 

CC 

0.71 

0.49 

0.132 

0.30 

0.20 

AQUA 

-0.55, 

TOP 

-0.39 

0.37 

0.  14" 

P  <  0.0001 

1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 


Table  12 

Perarson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
Typical  Subset  of  Illinois  Babitat  Variables 


CV1 

CV2 

CV3 

-0.181 

CV4 

GC 

-0.29 

0.20 

0.27 

CC 

0.75 

-0.43 

0.29 

SHRD 

0.132 

-0.132 

0.34 

0.49 

SHRH 

0.32 

0.152 

0.35 

SHRC 

0.20 

0.51 

NSPSHR 

0.46 

0.50 

-0. 181 

0.48 

CANAV 

0.92 

-0.27 

CANMX 

0.93 

-0.162 

NSPTRE 

0.81 

0. 132 

-0. 171 

SAP 

0.35 

-0.50 

0.61 

A 

0.58 

-0.32 

0.28 

0.55 

B 

0.83 

0.29 

0.18 

C 

0.85 

D 

0.79 

-0.17 

E 

0.76 

-0.24 
-0. 142 

-0.27 

F 

0.47 

-0.23 

G 

0.33 

0.201 

-0.16' 

H 

I 

TREMAS 

0.92 

-0.23 

0.15' 

P  <  0.0001 
0.0001  <  P  <  OvOOl 
0.001  <  P  0.01 


Table  13 


Pearson  Correlation  of  the  First 

Four  Canonical  Variates  With  the 

Typical 

+  Tree  Cuilds  Subsets 

of  Illinois 

Habitat 

Variables 

CV  1 

CV  2 

CV  3 

CV  4 

GC 

-0.30 

0.181 

CC 

0.60 

0.39 

-0.50 

SHRD 

0. 152 

-0. 181 

0.56 

SHRH 

-0. 142 

-0.142 

0.40 

SHRC 

0.22 

—0 . 1 3  2 

0.43 

NSPSHR 

0.35 

0.33 

0.25 

0.36 

CANAV 

0.81 

0.35 

-0.30 

-0 . 16  2 

CANMX 

0.78 

0.46 

-0.25 

NSPTRE 

0.71 

0.35 

0.22 

SAP 

0.30 

-0.49 

A 

0.41 

0.42 

-0.48 

0.24 

B 

0.67 

0.50 

-0. 142 

0.30 

C 

0.76 

0.34 

0.162 

D 

0.75 

0.21 

-0.181 

E 

0.73 

0.22 

F 

0.37 

0.41 

-0.28 

G 

0.26 

0.34 

-0.22 

H 

I 

0. 181 

-0. 152 

TREKAS 

0.  78 

0.43 

-0.29 

DUP 

0.86 

■0.42 

0. 142 

DBOT 

0.27 

0.72 

-0.29 

DRICH 

0.72 

■0.  132 

DEAR 

0.13 

0.31 

0.34 

0.142 

DDEA 

0.44 

0.35 

0.35 

DBL 

-0. 35 

-0.85 
-0 . 1 7 1 

0 . 1  7  1 

DC 

0.69 

DELM 

0.45 

0.57 

-0.27 

P  <  0.0001 

i 


1  0.0001  <  P  <  0.001 


Table  14 


Pearson  Correlation  of  the  First  Four  Canonical  Variates  With  the 
Complete  Set  of  Illinois  Habitat  Variables 


CV  1 

CV  2 

CV  3 

CV  4 

LI 

-0.16;? 

L2 

— 0 . 1 3  2 

-0.22 

0.51 

L3 

0.28 

0.27 

L4 

0.32 

0.49 

L5 

0.28 

0. 142 

0.49 

-0.25 

-0.18* 

L6 

0.36 

0.57 

L7 

0.48 

0.30 

0.48 

0.17* 
0. 132 

L8 

0.57 

0.44 

0.39 

L9 

0.66 

0.48 

0.37 

L10 

0.77 

0.44 

0.24 

-0. 142 

LI  1 

0.91 

0.28 

VEGVOL 

0.54 

0.25 

0.29 

VHW 

0.62 

0.29 

0.52 

HZHW 

0.36 
-0 . 1 9  * 

0. 132 

CC 

-0.29 

CC 

0.58 

0.23 

0.57 

-0. 152 

AQUA 

TOP 

0.33 
-0.21 
-0. 191 

-0 . 1  7  1 

-0.35 

SHRD 

-0.17? 

0.191 

0.27 

0.19* 

0.44 

SHRH 

-0 . 1 7 1 

0.31 

SHRC 

0.20 

0.34 

NSPSHR 

0.34 

0.32 

0.41 

-0.19* 

CANAV 

0.81 

0.24 

0.37 

CANMX 

0.77 

0.35 

0.39 

0.19* 

NSPTRE 

0.  70 

0.28 

0.20 

SAP 

0.28 

0.43 

-0.25 

A 

0.38 

0.26 

0.60 

B 

0.65 

0.40 

0.36 

0.27 

C 

0.74 

0.26 

0.18* 

0.25 

0.162 

D 

0.75 

E 

0.74 

0.21 

F 

0.40 

0.42 

-0.20 

G 

H 

I 

TREMAS 

0.28 

0. 191 

0.35, 

0.132 

-0. 162 

0.77 

0.32 

0.41 

0.18* 

DUP 

0.84 

-0.44 

DBOT 

0.29 

0.69 

-0.24 

DRICH 

0.71 

M  — * 

m  ct\ 

o  o 

DEAR 

0.14^ 

0.35 

-0.21 

DDEA 

0.42 

0.27 

0.28 

0.26 

DBL 

-0.38 

-0.26 

0.72 

-0.31 

DC 

0.62 

0.32 

-0. 1 62 

DELM 

0.47 

0.59 

P  <  0.0001 


1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 
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Figure  18.  Ordination  of  Illinois  study  sites  in  principal  components  space 
based  on  the  structural  variable  subset  of  habitat  variables. 
(See  Table  15.  Study  sites  are  identified  in  Appendix  D.  The 
loci  represent  mean  principal  components  scores.  A*  represents 
study  site  A  after  the  sapling  edge  was  deleted  from  the 
an a  lysis.) 
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Figure  19.  Ordination  of  Illinois  study  sites  in  principal  components  space 
based  on  the  typical  variable  subset  of  habitat  variables.  (See 
Table  16.  Study  sites  are  identified  in  Appendix  D.  The  loci 
represent  mean  principal  components  scores.  A*  represents  study 
site  A  after  the  sapling  edge  was  deleted  from  the  analysis.) 
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Figure  20.  Ordination  of  Illinois  study  sites  in  principal  components  space 
based  on  the  typical  ♦  tree  guilds  subset  of  habitat  variables. 
(See  Table  17.  Study  sites  are  identified  in  Appendix  D.  The 
loci  represent  mean  principal  components  scores.  A*  represents 
study  site  A  after  the  sapling  edge  was  deleted  from  the 
anal ys i s  .  ) 
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Figure  21.  Ordination  of  Illinois  study  sites  in  principal  components  space 
based  on  the  complete  set  of  habitat  variables.  (See  Table  18. 
Study  sites  are  identified  in  Appendix  D.  The  loci  represent  mean 
principal  components  scores.  A*  represents  study  site  A  after  the 
sapling  edge  was  deleted  from  the  analysis.) 


Table  15 


Pearson  Correlation  of  the  First  Four  Principal  Components  With  the 
Structural  Subset  of  Illinois  Habitat  Variables,  and  Summary 
of  the  Important  Associations  of  Habitat  Variables  With  the 
First  Pour  Principal  Components 


PC  1 

PC  2 

PC  3 

PC  4 

LI 

0.93 

0.15 

L2 

-0.20 

0.32 

0.67 

L3 

0. 142 

0.64 

0. 38 

L4 

0.21 

0.85 

0.20 

L5 

0.20 

0.86 

L6 

0.42 

0.76 

L7 

0.71 

0.43 

L8 

0.86 

0.23 

0 . 1 9  1 

0. 1 7 1 

0.13' 

L9 

0.92 

L10 

0.92 

L 1 1 

0.84 

VECVOL 

0.65 

0.49 

0.48 

0.15' 

VHW 

0.73 

0.64 

0.20 

HZHW 

-0. 142 

0.26 

-0.21 

0.69 

CC 

0.91 

CC 

0.62 

0.67 
-0.  181 

AQUA 

-0.71 

0.25 

TOP 

-0.36 

P  <  0.0001 


1  0.0001  <  P  <  0.001 


PC  1  + 


2  0.001  <  P  <  O.Oi 

Vegetation  layers  >  6.5  m  especially  8.5  m 

High  vegetation  voLume 

High  canopy  cover 

High  vertical  heterogeneity 


PC  2  ♦ 


Vegetation  layers  1-2  m 
High  ground  cover 

Vegetation  layers  2  to  7.5  m 
High  vegetation  volume 
High  canopy  cover 
High  vertical  heterogeneity 
High  horizontal  heterogeneity 


PC  3  + 


Vegetation  layers  <  2  m  especially  <  1  m 
High  ground  cover 
High  vegetation  volume 


Presence  of  water 


Vegetation  layers  1  to  !  m 
High  horizontal  heterogeneity 


,V.\  . 
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Table  16 

Pearson  Correlation  of  the  First  Four  Principal  Components  With  the 
Typical  Subset  of  Illinois  Habitat  Variables,  and  Summary 
of  the  Important  Associations  of  Habitat  Variables 
With  the  First  Four  Principal  Components 


PC  1 

PC  2 

PC  3 

PC  4 

cc 

-0.21 

0. 76 

cc 

0.83 

0.30 

SHRD 

0.92 

SHRH 

-0.16* 

0.  191 

0.85 

SHRC 

0.90 

NSPSHR 

0.46 

0.71 

CANAV 

0.89 

0.22 

CANMX 

0.92 

0.21 

NSPTRE 

0.84 

0.28 

0.19 

SAP 

0.49 

0.55 

A 

0.71 

0.38 

B 

0.86 

C 

0.86 

D 

0.77 

— 0 . 1 9 1 

E 

0.70 

0.29 

F 

0.41 

-0.21 

0.53 

C 

0.22 

0.77 

H 

0.51 

I 

0.59 

0. 15‘ 

TREMAS 

0.94 

0.25 

pc  l  ♦ 


PC  2  + 


PC  3  ♦ 


PC  4  ♦ 


P  <  0.0001 

1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 
High  basal  area  of  trees 

High  canopy  cover,  average  and  maximum  canopy  heights 
High  density  of  trees  8  to  53  cm  dbh 
Many  tree  species 

High  ground  cover 
High  shrub  heterogeneity 

High  shrub  density,  cover,  and  heterogeneity 
High  density  of  trees  <  15  cm  dbh 

High  density  of  trees  38  to  68  cm  dbh 

High  density  of  trees  >  53  cm  dbh 

High  ground  cover 
Many  shrub  species 
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Table  17 


Pearson  Correlation  of  the  First  Four  Principal  Components  with  the 
Typical  ♦  Tree  Cuilds  Illinois  Habitat  Variables,  and  Summary 
of  the  Important  Associations  of  Habitat  Variables 
With  the  Pirst  Pour  Principal  Components 


PC  1 

PC  2 

PC  3 

PC  4 

cc 

-0.35 

0.26 

cc 

0.76 

0.30 

0.35 

0.26 

SHRD 

0. 152 
-0. 181 

-0.28 

0.85 

SHRH 

-0. 162 

0.76 

SHRC 

0.21 

-0.24 

0.84 

NSPSHR 

0.28 

0.21 

0.39 

-0.47 

CANAV 

0.87 

0.31 

0.  .71 

CANMX 

0.84 

0.38 

NSPTRE 

0.75 

0.21 

0.42 

-0.25 

SAP 

0.47 

0.56 

0.35 

A 

0.59 

0.30 

0.51 

0.26 

B 

0.74 

0.32 

0.24 

-0. 132 

C 

0.80 

0.21 

-0. 171 

D 

0.79 

-0.20 

E 

0.74 

0.22 

-0.21 

-0  . 1 9 1 

F 

0.33 

0.181 

0.59 

-0.20 

C 

0.62 

H 

0. 152 

0.33 

-0. 152 

I 

0.25 

TREMAS 

0.89 

0.40 

0.  162 

DUP 

0.78 

-0.44 

-0.22 

DBOT 

DRICH 

0.26 

0.65, 

0.142 

0.80 

-0.191 

-0.41 

DEAR 

0 .42 

-0.50 

DDEA 

0.49 

0.26 

DBL 

0.24 

0.38, 

DC 

0.76 

0.18" 

DELM 

0.37 

0.52 

0. 142 

-0.37 

P 

< 

0.0001 

1  0.0001 

<  P 

< 

0.001 

2  0.001 

<  P 

< 

0.01 

PC  1  +  High  basal  area  of  trees 

High  canopy  cover,  average  and  maximum  canopy  height 
High  density  of  trees  8  to  53  cm  dbh 
Many  tree  species 

High  dominance  of  upland  tree  species 
Snags 


High  ground  cover 

High  shrub  heterogeneity 

High  density  of  trees  >  53  cm  dbh 

High  dominance  of  bottomland  tree  species 


PC  2  ♦ 


>  S' 


Table  17  (Cont'd) 


High  dominance  of  upland  tree  species 


PC  3  ♦ 


High  shrub  density,  cover,  and  heterogeneity 

High  density  of  trees  <  15  cm  dbh 

High  dominance  of  early  succession  tree  species 


PC  4  ♦ 


High  dominance  of  black,  locust 


High  dominance  of  early  succession  and  upland  tree  species 
Many  shrub  species 


Table  18 


Pearson  Correlation  of  the  First  Four  Principal.  Components  With  the 
Complete  Set  of  Illinois  Habitat  Variables  and  Summary  of  the 
Important  Associations  of  Habitat  Variables  With  the 
First  Pour  Principal  Components 


PC  1 

PC  2 

PC  3 

PC  4 

LI 

0.93 

L2 

-0.47 

0.39 

L3 

0.29 

0.56 

-0.23 

0.15 

L4 

0.26 

0. 181 

0.78 

L5 

0.77 

0. 15l 

L6 

0.31 

0.76 

L7 

0.49 

0.57 

0.25 

L8 

0.65 

0.43 

0.27 

L9 

0.74 

0.37 

0.29 

L10 

0.80 

0  25 

0.36 

LI  1 

0.88 

0.27 

0 . 1  7  1 

0. 152 

-0.13' 

VECVOL 

0.55 

0.55 

0.48 

VHW 

0.62 

0.72 

HZHW 

0.45 

CC 

-0.22 

0.92 

CC 

0.54 

0.73 

0.21 

0. 132 

AQUA 

TOP 

0 . 1  7  1 

-0.20 

-0.59 

SHRD 

0.48 

-0.62 

0.21 

SHRH 

0.36 

-0.48 

0.17 

SHRC 

0.24 

0.50 

-0.58 

0.21 

NSPSHR 

0.51 

0.38 

CANAV 

0.76 

0.42 

0.31 

CANMX 

0.78 

0.42 

0.31 

NSPTRE 

0.78 

0.43 

SAP 

0.21 

0.77 

A 

0.39 

0.75 

0. 142 

B 

0.73 

0.38 

0.20 

0.181 

C 

0.80 

0. 191 

D 

0.80 

E 

0. 74 

0.26 

F 

0.39 

0.56 

C 

0.25 

0. 1  7  1 

0.53 

H 

0.33 

I 

0 . 19  1 

-  V  % 
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Table  18  (Conl'd) 


PC  1 

PC  2 

PC  3 

TREMAS 

0.77 

0.48 

0.32 

DUP 

0.75 

-0.38 

DBOT 

0.32 

0.72 

DRICH 

0.72 

0. 1?1 

0.  19 1 

-0 .24 

DEAR 

0.34 

-0.16 

ODEA 

0.47 

0.18 

DBL 

-0.51 

0.60 

DC 

0.20 

0. 171 

0.69 

DELM 

0.52 

0.34 

P  <  0.0001 

1  0.0001  <  P  <  0.001 

2  0.001  <  P  <  0.01 


PC  1 


PC  2 


PC  3 


PC  4 


Vegetation  layers  >  6.5  m,  especially  >  8.5  m 

High  vegetation  volume  and  basal  area  of  trees 

High  canopy  cover,  average  and  maximum  canopy  heights 

High  vertical  heterogeneity 

High  density  of  trees  15  to  53  cm  dbh 

Many  tree  species 

High  dominance  of  upland  tree  species 
Snags 

High  ground  cover 

High  dominance  of  black  locust 

Vegetation  layers  2  to  7.5  m 

High  vegetation  volume 

High  canopy  cover 

High  vertical  heterogeneity 

High  density  of  trees  <  15  cm  dbh 

High  dominance  of  black  locust 

High  shrub  density,  cover,  and  heterogeneity 

High  horizontal  heterogeneity  of  vegetation 

Vegetation  layers  >  6.5  m 

Scarcity  of  vegetation  layers  <  5  m 

High  density  of  trees  >  53  cm  dbh 

High  dominance  of  bottomland  tree  species 

Vegetation  layers  1  to  3  r.i 

High  dominance  of  upland  and  early  succession  tree  spec 
High  shrub  density,  cover,  and  heterogeneity 

Vegetation  layers  <  2  m,  especially  <  1  m 
High  ground  cover 

High  dominance  of  early  succession  tree  species 
High  vegetation  volume 


Presence  of  aquatic  habitats 
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Figure  29.  Cluster  analysis  of  the  Mojave  Desert  transects  using  the  first 
principal  component  from  a  PCA  where  the  maximum  number  of 
eigenvectors  was  extracted. 
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APPENDIX  B 


MOJAVE  HABITAT  VARIABLES 


Number 

Code 

Habitat  Variable 

1 

CCB 

cover  of  Larrea  tridentata  (creosote  bush) 

( m '  ' ha  ) 

2 

CBW 

cover  of  Ambrosia  dumosa  (burroweed) 

(m  / ha) 

3 

CCHE 

cover  of  Hymenoclea  salsola 

( m  ' ha ) 

4 

CSH 

cover  of  Crayia  spinosa  ♦  Lycium  spp. 

( m  ha  ) 

5 

CTB 

cover  of  Thamnosma  montana 

( m '  '  ha  ) 

6 

CMT 

cover  of  Ephedra  spp. 

( m  ,'ha  ) 

7 

CCAS 

cover  of  Cassia  armata 

(m  'ha) 

8 

CUD 

cover  of  Psorothamnus  arborescens 

( m  'ha  ) 

9 

CA 

cover  of  Acamptopappus  sphaerocephalus 

( m  na  ) 

10 

CCD 

cover  of  Salazaria  mexicana  ♦  Eriogonum 

fasciculatum 

( "i  ha  ) 

1 1 

C 12  3 

cover  of  Amphipappus  fremontii  ♦ 

Haplopappus  cooperi  ♦ 

Encelia  frutescens 

( m  1  hat 

12 

CREST 

cover  of  CCHE  to  C123  combined 

(subdominant  shrubs) 

( Ti  ha) 

1  3 

HCB 

mean  height  of  Larrea  tridentata 

(on) 

14-24 

HBW-HREST 

mean  height  of  HBW-HREST 

( n  > 

2  5 

Cll 

cover  of  shrubs  <  0.5  m  in  height 

It  ha) 

26 

CL2 

cover  of  shrubs  >  0.5  m  to  <  1  m  m  height 

(  m  r.u  ) 

2  7 

CL3 

cover  of  shrubs  >  1  m  to  <  1.5  m  in  height 

(  -l  nj  i 

2  8 

CL4 

cover  of  shrubs  >  1.5  m  in  height 

(  t  ha' 

29 

CRASS 

grass  cover 

(  '.  i 

in 

KORB 

forb  cover 

(  i 

3. 

LIT 

litter  cover 

(  '.  • 

12 

SAND 

sand  (particle  sizes  <  3  mm) 

(  1  i 

1  i 

CSAND 

coarse  sand  (particle  sizes  >  3  mm  i o  <  1  cm) 

(  ' 

14 

CRAV 

gravel  (particle  sizes  >  1  to  <  8  cm) 

i  '.  i 

36 

ROCK 

rock  (particle  sizes  >8  cm) 

i  .  i 

16 

CVDCB 

coefficient  of  variation,  diameter  ot  creosote  bush 

17 

CVDBW 

coefficient  ot  variation,  diameter  ot  burroweed 

18 

CVDR 

coefficient  ot  variation,  diameter  of  subdominant  snr.ias 

19 

CVHCB 

coefficient  of  variation,  height  ot  creosote  bush 

40 

CVHBW 

coefficient  ot  variation,  height  of  burroweed 

4  1 

CVHR 

coefficient  ot  variation,  height  of  subdominant 

shrubs 

42 

CVNC8 

coefficient  ot  variation,  density  ot  creosote  bush 

41 

CVNBW 

coefficient  ot  variation,  density  ot  burroweed 

44 

CVNR 

coefficient  of  variation,  density  ot  subdominant 

*h  r  iDs 

15 

1.2 

density  of  shrubs  0.2  m  in  height 

( shruDs 

.  bOO  -  i 

46 

!.4 

density  ot  shrubs  0.3  -  0.4  m  in  height 

(  shr 

1  *s  0 1 1  - 

4  7 

Lb 

density  of  shrubs  0.5  -  0.6  m  in  height 

(  s*  r  uDs 

. nor  -  . 

48 

L8 

density  ot  shrubs  0.7  -  0.8  m  in  height 

(  shrubs 

.boo  - 

4  9 

1.10 

density  of  shrios  0.9  -  1.0  m  in  height 

l  *  b  r  ta  r>  s 

,1-iW  - 

60 

1.12 

density  if  shrubs  l.l  -  1.2  m  in  height 

(  s r  r  iris 

.bi'i'  - 

5  1 

1.14 

density  ot  shrubs  1.3  -  1.4  m  in  height 

( shr j h s 

bo11  i 

5  2 

1.16 

density  of  shrubs  1.5  -  1.6  m  in  height 

(  s  r  -  ^  r>  s 

.6  99  - 

5  1 

1.18 

density  ot  shrubs  1.7  -  1.8  m  in  height 

<  *  •*.  r  * r' 

.  bOr) 

54 

L20 

density  of  shrub*  1.9  -  2.0  •  m  height 

( ihrubi 

1600  m  < 

56 

1.22 

density  of  shrubs  2.1  -  2.2  ■  in  height 

( *hr ubs 

1600  m  ) 

56 

1.2  7 

density  of  shrubs  »  2.2  m  in  height 

(  shrub* 

1600  m  ) 

5  7 

SHCOV 

total  shrub  cover 

Is  hi  ) 

58 

NSP 

number  of  shrub  species 

(  N  1 

59 

L4J* 

L4  ♦  L6 

(  shrubs 

1600  m  1 

60 

L 1  2  J* 

1.12  ♦  1.14 

(  shrub* 

1600  t  1 

Variable*  created  after  imt  ial  data  analysis  because  of  very  h  i  fth  correlat  i  ins. 
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APPENDIX  C 


VARIABLE  TRANSFORMATIONS  USED  FOR 
MOJAVE  DESERT  HABITAT  VARIABLES 


Variable  (X) 
Habitat  variables: 


Transformation  of  Variable  X 


CCB  to  crest  and  L2  to  SHCOV  (see  Appendix  B) 


l°ge(X  ♦  1) 


HCB  to  HREST  (see  Appendix  B) 


loge(100X  ♦  1) 


Shrub  diameter  and  height 


loge(100X) 


Shrub  density 


Cround  cover  and  substrate  size 


arcsine  p 


Horizontal  heterogeneity  CVDCB  to  CVNR 


loge(100X) 


(see  Appendix  B) 


Population  estimates  for  mammals 


loge(X  ♦  1) 


Population  estimates  for  birds 


(X  ♦  0.5) 


Biomass 


loge(X> 


^Variable  X  is  expressed  as  a  proportion  (decimal).  Since  the  arcsine 
transformation  is  unavailable  with  many  statistical  packages,  an  equivalent 
form  was  used: 


arctan(p^/(l  -  p^)^^) 
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APPENDIX  E 


ILLINOIS  HABITAT  VARIABLES 


Variable 


VECVOL 

VHW 

HZHW 

GC 

CC 

AQUA 

TOP 

SHRD 

SHRH 

SHRC 

NSPSHR 

CANAV 

CANMX 

NSPTRE 


DUP 

DBOT 

DRICH 

DEAR 

DDEA 

DBL 

DC 

DELM 


Type  of 
Variable 


Ll 

S 

Vegetation  hits 

0  -  <  1  m 

L2 

S 

Vegetation  hits 

1  -  <  2  m 

L3 

S 

Vegetation  hits 

2  -  <  3  m 

LA 

S 

Vegetation  hits 

3  -  <  A  m 

L5 

S 

Vegetation  hits 

A  -  <  5  m 

L6 

s 

Vegetation  hits 

5  -  <  6.5  m 

L7 

s 

Vegetation  hits 

6.5  -  <  7.5  m 

L8 

s 

Vegetation  hits 

7.5  -  <  8.5  m 

L9 

s 

Vegetation  hits 

8.5  -  <  10  m 

L10 

s 

Vegetation  hits 

10  -  <  15  m 

Ll  1 

s 

Vegetation  hits 

>  15  m 

S 

S 

S 

S,T 

S,T 

S 

S 

T 

T 

T 

T 

T 

T 

T 


G 

G 

G 

G 

G 

G 

G 

C 


Total  vegetation  hits 
Vertical  heterogeneity  within  quadrat 
Horizontal  heterogeneity  within  quadrat 
Ground  cover  (2) 

Canopy  cover  (X) 

Aquatic  hits  (2) 

Level  ground  (2) 

Shrub  density  (stems/0.1  ha) 

Shrub  heterogeneity 
Shrub  cover  (rrr/0.1  ha) 

Number  of  shrub  species  in  the  16  individuals  measured 
Average  height  of  dominant  vegetation  (m) 

Highest  vertical  projection  of  vegetation  (m) 

Number  of  tree  species  found  in  the  26-m  quadrat 


SAP 

T 

Density 

of 

trees 

3 

to 

< 

8 

cm 

dbh 

(number /ha) 

A 

T 

Density 

of 

trees 

8 

to 

< 

15 

cm 

dbh 

(number/ha ) 

B 

T 

Density 

of 

trees 

15 

to 

< 

22 

cm 

dbh 

(number /ha) 

C 

T 

Density 

of 

trees 

22 

to 

< 

30 

cm 

dbh 

(number/ha) 

D 

T 

Dens i ty 

of 

trees 

30 

to 

< 

38 

cm 

dbh 

( number/ha ) 

E 

T 

Density 

of 

t  rees 

38 

to 

< 

53 

cm 

dbh 

(number/ha ) 

F 

T 

Density 

of 

t  rees 

53 

to 

< 

68 

cm 

dbh 

(number /ha ) 

G 

T 

Dens i ty 

of 

t  rees 

68 

to 

< 

8A 

cm 

dbh 

( number/ha ) 

H 

T 

Density 

of 

trees 

8A 

cm 

to 

< 

1  m 

dbh 

(number /ha) 

r 

T 

Density 

of 

t  rees 

>  1 

m 

dbh 

(number/ha ) 

TREMAS 

T 

Total  basal 

area 

of 

t  rees 

(m2 /0.053 

Basal  area  of  upland  tree  species 

Basal  area  of  bottomland  tree  species 

Basal  area  of  rich/moist  soil  tree  species 

Basal  area  of  early  succession  tree  species 

Basal  area  of  snags 

Basal  area  of  black  locust 

Basal  area  of  cottonwood 

Basal  area  of  American  and  slippery  elms 


(m2/0.053 


(m2/0.053 
(m2/0.053 
(m2/0.053 


ha  1 
ha  1 
ha  ] 
ha  ] 
ha] 


'•"S  =  st ructural ;  T=typical;  C  =  tree  guilds. 
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ILLINOIS  TREE  SPECIES,  SCIENTIFIC  NAMES, 
AND  CUILD  COMPOSITIONS 


Cosmopol i can 


Upland 

Species 


Bottomland 

Species 


Mo  i  st /Rich 
Soil  Species 


Early  Succesion 
Spec i es 


Black  locust 
Cottonwood 

American  elm 
Siberian  elm 
Slippery  elm 

White  ash 
Basswood 

Flowering  dogwood 
Shagbark  hickory 
Sweet  pignut  hickory 
Hop  hornbeam 
Sugar  maple 
Red  mulberry 
White  mulberry 
Black  oak 
Chinquapin  oak 
White  oak 
Blackhaw 

European  black  aLder 

Creen  ash 

Bald  cypress 

Burning  bush 

Cray  dogwood 

Rough-leafed  dogwood 

Sweetgum 

Box  elder 

Silver  maple 

Bur  oak 

Swamp  white  oak 

Sycamore 

Willow 

Ohio  buckeye 
Catalpa 

Kentucky  coffee  tree 
Hackberry 
Bicternut  hickory 
[ ronwood 
Nannyberry 
Red  oak 
Shingle  oak 
Paw  paw 

Northern  prickly  ash 
Redbud 

Black  walnut 
Butternut 

Eastern  red  cedar 
Black  cherry 
Crabapple 
Hawthorn 
Honey  locust 
Osage  orange 
Peach 
P 1  um 

Sassaf ras 
Smooth  sumac 


Robinia  pseudoacacia 


Populus  deltoides 


Utmus  americana 
Ulmus  pumi la 
Ulmus  rubra 

Fraxinus  americana 
Tilia  americana 
Cornus  Florida 
Carya  ovata 


Ostrya  virginiana 
Acer  saccharum 
Morus  rubra 
Morus  alba 
Quercus  velutina 
Quercus  muehlenbergi i 
Quercus  alba 
Viburnum  pruni folium 


Alnus  glut inosa 
Fraxinus  pennsyl vanica 
Taxodium  distichum 
Buonymus  atropurpureus 
Cornus  racemosa 
Cornus  drummondii 
Liquidambar  styrac i f lua 
Acer  negundo 
Acer  saccharinum 
Quercus  macrocarpa 
Quercus  bicolor 
Plantanus  occidental i s 
Sal ix  spp. 

Aesculus  glabra 
Catalpa  speciosa 
Cymnoc ladus  dioicus 
Celtis  occ idental is 
Carya  cordiformis 
Carpinus  carol iniana 
Viburnum  lentago 
Quercus  rubra 
Quercus  imbricaria 
As imina  triloba 
Zanthoxylum  americanum 
Cerci  s  canadens i s 
Juglans  nigra 
Juglans  cinerea 


Juni perus  vi rgin i an a 
Prunus  serot i na 
Mai  us  spp. 

Crataegus  spp. 
Cleditsia  triacanthos 
Mac l ura  pomi fera 
Prunus  pers i ca 
Prunus  americana 
Sassafras  albidum 
Rhus  glabra 
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APPENDIX  G: 


CHARACTERIZATION  OP  DISPERSION 
AND  COVARIANCE  MATRICES 


The  dispersion  matrix  consists  of  elements  of  the  form: 


l  (X  .  -  X  )(X.  -  X  ) 
. ,  ai  a  Bi  8 

i  =  l 


[Eq  Cl] 


a  =  1,2, .....p 


8  =  1,2, 


This  matrix,  which  is  also  called  the  sums  of  squares  and  cross  products 
matrix  (SSCP),  gives  information  about  the  scatter  of  p  variates  around  their 
means  with  a  sample  size  N. 


However,  the  magnitude  of  the  elements  is  a  function  of  sample  size.  It 
is  obvious  that  as  N  increases,  the  above  term  increases.  Dividing  by  sample 
size  N  gives  each  element  an  "average  dispersion"  and  the  matrix,  called  a 
variance-covariance  (or  more  simply  covariance)  matrix,  is  independent  of 
sample  size. 


Since  a  =  8  along  the  diagonal  of  the  matrix,  diagonal  elements  will  be: 


i/N  y  cx  x  y 

■  Lj,  ai  a 
i  =  l 


Since  this  term  is  recognized  as  the  variance,  a  covariance  matrix  consists  of 
p  variance  components  along  its  diagonal,  and  (p2  -  p ) / 2  covariances  as  off- 
diagonal  elements. 
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